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A  policy  of  periodic  replacement  with  minimal  repair  at  failure  is  considered 
for  a  complex  system.  Under  such  a  policy  the  system  is  replaced  at  multiples 
of  some  period  T  while  minimal  repair  is  performed  at  any  intervening  system 
failures.  The  cost  of  a  minimal  repair  to  the  system  is  assumed  to  be  a  nonde¬ 
creasing  function  of  its  age.  A  simple  expression  is  derived  for  the  expected 
minimal  repair  cost  in  an  interval  in  terms  of  the  cost  function  and  the  failure 
rate  of  the  system.  Necessary  and  sufficient  conditions  for  the  existence  of  an 
optimal  replacement  interval  are  exhibited  in  the  case  where  the  system  life  dis¬ 
tribution  is  strictly  increasing  failure  rate  (IFR). 
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1.  INTRODUCTION 

A  complex  system  may  fail  if  one  of  its  many  components  ceases  to  function.  The  system 
is  returned  to  the  operating  state  when  the  failed  component  is  replaced.  As  the  great  majority 
of  components  have  not  been  replaced,  the  remaining  life  distribution  and  failure  rate  of  the 
system  are  essentially  undisturbed.  This  type  of  system  repair,  whereby  the  failure  rate  of  the 
system  is  not  altered  by  the  failure  and  subsequent  repair  of  the  system  is  known  as  minimal 
repair. 

A  policy  of  periodic  replacement  with  minimal  repair  at  failure  is  one  in  which  the  system 
is  replaced  at  multiples  of  some  period  T  while  performing  minimal  repair  at  any  intervening 
system  failures.  This  type  of  policy  was  introduced  and  investigated  by  Barlow  and  Hunter  [1] 
(see  also  Barlow  and  Proschan  [2]).  In  their  paper,  Barlow  and  Hunter  show  how  to  calculate 
the  optimal  period  T  assuming  the  cost  of  a  minimal  repair  is  constant  and  using  as  an  optimal¬ 
ity  criterion  the  minimization  of  total  expected  cost  per  unit  time  over  an  infinite  time  horizon. 
Tilquin  and  Cleroux  [8]  modify  this  model  by  introducing  to  the  cost  analysis  a  general  cost 
function  (to  account  for  depreciation  or  adjustment  costs,  interest  charges,  monitoring  costs  and 
the  like),  which  increases  continuously  with  the  length  of  time  the  system  is  in  use.  However, 
as  in  the  model  considered  by  Barlow  and  Hunter,  the  cost  of  a  minimal  repair  to  the  system  is 
constant  (and  hence,  for  example,  does  not  depend  on  the  time  at  which  the  repair  is  made  or 
the  previous  number  of  repairs  to  the  system).  Boland  and  Proschan  [3]  generalize  the 
Barlow-Hunter  model  to  incorporate  the  situation  when  the  cost  of  a  minimal  repair  is  an 
increasing  function  of  the  number  of  previous  repairs  to  the  system.  Further  generalizations  of 
the  Barlow  and  Hunter  model  can  be  found  in  the  work  of  Nakagawa  (see  for  example  Naka- 
gawa  (5)). 

•Research  supported  by  Air  Force  Office  of  Scientific  Research,  U.S.A.F.,  under  Contract  No.  F49620-82-K-0007. 
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In  this  paper,  it  is  assumed  that  the  cost  of  a  minimal  repair  to  the  system  which  fails  at 
age  /  is  C(r),  where  CO)  is  a  continuous  nondecreasing  function  of  /.  Hence,  as  the  system 
ages  it  becomes  more  expensive  to  perform  minimal  repair.  F  will  denote  the  life  distribution 
function  of  the  system  with  density  /  r(r)  will  denote  the  failure  rate  function  of  the  system, 
while  R(t)  *  J"Q  r(s)ds  is  the  cumulative  failure  rate  function  or  hazard  function  of  the  sys¬ 
tem.  It  is  assumed  that  F(0)  *  0  and  that  r(t)  is  a  positive  continuous  function  of  i.  The 
problem  of  finding  an  optimal  period  T0  for  periodic  replacement  (where  minimal  repair  is  per¬ 
formed  according  to  the  cost  function  CO)  on  system  failure)  is  investigated.  It  is  shown  that 
if  C(t)r(t )  is  nondecreasing  (in  particular  if  F  is  IFR),  then  an  optimal  period  T0  (possibly 
infinite)  exists. 


2.  EXPECTED  MINIMAL  REPAIR  COSTS 

Now  consider  the  situation  where  the  system  is  in  operation  in  the  time  interval  (0,  T). 
On  failure,  minimal  repair  is  made  (according  to  CO ))  and  we  assume  that  repair  time  is  negli¬ 
gible.  If  the  system  is  not  replaced  in  this  interval,  the  following  theorem  gives  two  useful 
expressions  for  the  expected  costs  of  minimal  repair  in  (0,  T). 

THEOREM  I:  The  expected  minimal  repair  cost  of  the  system  in  the  interval  [0,  T)  is 

J-R(T)  ,  rT 

„  CUTl(t))dt  -  f  C(t)r(t)dt. 

PROOF:  Let  NT  be  the  random  variable  denoting  the  number  of  minimal  repairs  per¬ 
formed  on  the  system  in  the  age  interval  [0,  T).  We  know  that  Ay  has  a  Poisson  distribution 
with  parameter  R(T )  (see  Barlow  and  Proschan  12],  pp.  96-97  and  Cox  and  Miller  [4],  p.  153). 

Now  if  Nt  «  k,  and  /]  ,  ...  ,  r*  are  the  times  of  the  minimal  repairs,  then  the  total 

minimal  repair  cost  in  the  interval  [0,  T)  is  CO,).  Given  Ay  “  k,  we  know  that  T]  =■ 

i-T 

/?0|),  ...  ,  r*  —  /?(/*)  are  distributed  as  the  order  statistics  of  a  random  sample  of  size  k 
from  the  uniform  distribution  on  ]0,  R(T))  (see,  for  example,  Parzen  [6],  pp.  139-143  or 
Thompson  [7]).  Hence,  the  expected  minimal  repair  cost  given  Ay  —  k  is 

£(CO|)  +  ...  +  CO*)  I  Ay  •  k ) 

-  e(c(r~'(tO  + ...  +  c(/r'(r*))|Ay-  k ) 

-  kE(C(R-'(r))\\r-  k) 

(where  r  is  uniformly  distributed  on  [0,  R(T ))) 

J.A(D  ,  i 

.  jjfy  « 

-  — T=r  ('<n  C<R-'Ui)it. 

R(T ) 
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Therefore,  the  expected  minimal  repair  cost  in  the  interval  [0,  T)  is 
E^iEiCitJ  +  ...  +  Citk)\NT  =  k)) 

‘  JJ T)  C'  CiR~Ht)U' 

1  r*<r> 

-  -s-rpr  L  CiR~'it))dt  [Ri D) 
RiT)  Jo 

J>R(T) 

CiR~'it))dt 

o 

J.  T 

„  C(w)r(w)dw 
o 


(using  the  change  of  variable  w  =  R  lit)). 


REMARK  1:  In  the  above  proof  of  Theorem  1,  the  number  of  minimal  repairs  to  the 
system  in  (0,  T)  is  a  nonhomogeneous  (or  homogeneous)  Poisson  process  with  intensity  func¬ 
tion  r(r).  In  particular,  given  that  the  system  is  functioning  at  age  /,  the  probability  of  a  sys¬ 
tem  failure  in  the  interval  it,  t  +  h)  is  of  th£  form  rit)h  +  oih).  Moreover,  the  mean 
number  of  failures  in  the  interval  [0,  T)  is  fQ  rU)dt  «•  R(T).  This  leads  us  to  interpret 
C(t)r(t)  in  a  naive  sort  of  way  as  the  "rate"  of  spending  a  dollar  on  minimal  repair  at  age  t. 
With  this  interpretation  f  C(t)r(t)dl  represents  the  mean  number  of  dollars  spent  on 
minimal  repair  in  (0,  T),  which  we  have  rigorously  demonstrated  in  Theorem  1. 

We  now  consider  some  examples  of  minimal  repair  cost  functions  and  determine  the 
resulting  expected  minimal  repair  cost  in  an  age  interval  [0,  T)  for  a  given  life  distribution.  It 
is  reasonable  to  assume  that  the  cost  of  a  minimal  repair  to  a  system  should  be  a  nondecreasing 
function  of  its  age.  Perhaps  even  more  appropriate  would  be  a  nondecreasing  function  of  the 

life  distribution  Fit)  or  equivalently  R  (t)  or  -=-  ■--. 

Fit ) 

EXAMPLE  1:  Let  Cit)  be  of  the  form  Cit)  **  giRit))  where  g  is  nondecreasing  and 
G'  —  g.  Then 

f  T  Cit)rit)dt  -  f  T  giRit)) dR it)  =  GiRit))  -  GiO). 

•J  0  0 


We  consider  the  following  particular  cases: 


I.  Ot  y> 

log  -=- —  and  f  Cit)rit)dt  * 

Fit)  *,° 

DOTIIT) 

c  - - —  When  a  —  l,  i.e.,  minimal  repair  cost  is  proportional  to  the  cumula- 

o  +  l 

tive  failure  rate  function,  then  the  expected  minimal  repair  cost  in  (0,  T)  is 
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fill  T\ 

c  — - — .  When  a  =  0,  one  obtains  the  cost  function  used  by  Barlow  and 
Hunter  111. 


(b)  giy)  *•  ce“y  (a  >  0).  Then  Cit) 

Cit)r{t)dt  -  -  ieaRm  -  1)  -  — 
a  a 


g(R  (/)) 
1 


ce 


aR(  l) 


1 


and  So 


-  1 


( F(t))a 

.  When  a  *  1,  i.e.,  the  cost  of 


F°(T) 

a  minimal  repair  is  inversely  proportional  to  the  survival  probability,  then  the 
expected  minimal  repair  cost  in  [0,  T)  is  proportional  to  the  odds  ratio 
F(TVFiT). 

The  linear  combinations  of  (a)  and  (b)  constitute  a  large  class  of  reasonable  cost  functions  for 
minimal  repair. 

EXAMPLE  2:  One  might  consider  a  cost  function  of  the  type  Cit )  =  cf  (a  ^  0).  In 
the  particular  case  when  a  =■=  1  this  yields  an  expected  minimal  repair  cost  in  [0,  T)  of 

f  clr(t)di  —  cR{T)T  -  f  cR  (t)dt. 

Jo  Jo 

If  the  life  distribution  is  Weibull  of  the  form  Fit)  =  /-  e~(kl)l>,  then  the  expected  cost  is 

cikT^T-  fT c(\t)^dt  ^  ckRP+'\-^-r\. 

Jo  (/3  +  1  ) 


3.  PERIODIC  REPLACEMENT  WITH  MINIMAL  REPAIR 

Let  us  now  consider  the  problem  of  finding  a  period  7o  for  replacement  which  minimizes 
expected  long-run  cost  per  unit  of  time.  Co  will  denote  the  cost  of  a  planned  system  replace¬ 
ment.  If  ¥(T)  represents  the  expected  long-run  cost  per  unit  of  time  when  the  system  is 
periodically  replaced  at  times  T,  2T,  3T,  . . .  ,  then  ti T)  has  the  form 

fn  C(t)r(t)dt  +  c0 
-  — - - - 


Therefore, 


ViT) 


C(T)r(T)T- fj  C(t)r(t)dt  -  cQ 
_ 


and  this  yields 

THEOREM  2:  If  Cit)r(t)  is  a  nondecreasing  function  of  t,  then  an  optimal  replacement 
interval  T0  exists.  T0  is  finite  if 

lim  f  T  [CiT)r(T)  -  Cit)rit))dt  >  c0 

T—m  JO 
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or  if  C it)rit)  is  eventually  constant  and 

lim  f  [CiT)riT)  -  Cit)rit)\dt  >  c0, 

7— K»  0 


otherwise  the  optimal  policy  is  to  never  replace. 

/.  T 

CO) /•(/)<*  is  nondecreas¬ 
ing  and  hence  C'i  T)  has  a  zero  iff 

lim  f  (C(7)r(7)  -  Cit)rit)]dt  >  c0 

T—<*>  •'O 


or 

lim  ClC(T)r(T)  -  Cit)rit))dt  >  c0 

7—oo  *  0 

if  CiT)riT)  is  eventually  constant. 


REMARK  2:  If  F  is  an  IFR  distribution  with  strictly  increasing  failure  rate  function  rit), 
then  there  exists  a  unique  optimal  replacement  interval  T0  (possibly  infinite). 


EXAMPLE  3:  Let  us  consider  Cit)  of  the  form  CO)  *  ceaR(,)  (see  Example  1(b)):  if  F 
is  Weibull  of  the  form  Fit)  “  1  -  e~(K,)0  where  0  >  1,  then  the  optimal  period  T0  must 
satisfy 


ce 


>urn)0 


/ax  ng 


If  a  =  0,  then  T0 


± 

A 


l  03  -  l)c) 


l/R 


EXAMPLE  4:  Let  CO )  be  of  the  form  Cit )  —  c(Rit))a  and  7  be  Weibull  where  Fit ) 
»  1  -  e~ik,) Then  Cit)rit)  —  c0\i\t)aP+R~i.  This  is  increasing  iff  <*0  +  0  >  1,  and 
hence,  in  particular,  if  0  >  1  (or  for  example  0  —  1/2  iF  is  DFR)  and  a  “  3/2)  there  is  a 
unique  optimal  replacement  interval  7<j. 


REMARK  3:  We  may  also  wish  to  consider  the  problem  of  finding  a  replacement  interval 
7o  which  minimizes  total  expected  costs  over  a  finite  time  horizon  [0,  t0).  For  period  7,  where 
mT  <  to  <  im  +  1)7  for  some  integer  m,  we  see  that  the  total  expected  cost  in  10,  /0)  is 

/»  T  ft 

0  Cit)rit)dt  +  JQ  Cit)rit)dt. 


If  Cit)rit)  is  an  increasing  function  of  t,  it  follows  that  ^f,#’(7)  —  m[CiT)riT)  — 


h 


ro 


Citg  —  mT)rit0-  mT) 1  >  0  in 
optimal  replacement  interval  70  is  an  element  in  the  set  { r0.  to/2,  to/3.  . . .  ). 


,  ,  ,  —  .  Therefore,  if  Cit)rit)  is  increasing  then  the 
m  +  l  m 


— _ M 
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j  ABSTRACT 

In  this  paper  formulas  are  derived  for  the  reliability  of  a  single  unit  to 
which  identical  spares  in  standby  are  allocated,  with  all  of  these  units  having  a 
hyper-exponential  or  Erlang  distributed  lifetime.  Two  advantages  are  obtained 
by  using  these  distributions.  First,  the  general  procedure  for  calculating  the 
effect  of  redundancy  is  applicable,  in  contrast  with  most  of  the  common  life  dis- 
|  tributions,  such  as  Weibull.  Moreover,  both  distributions  are  suitable  for 

matching  to  observed  curves  and  practical  data  by  varying  the  values  of  the 
parameters,  covering  together  most  of  the  cases  of  practical  interest. 


I.  INTRODUCTION 

A  complex  system  made  up  of  several  subsystems  in  series,  cannot  successfully  perform 
its  function,  unless  every  subsystem  remains  in  operation.  In  case  of  malfunctioning  of  one  of 
them,  a  spare  must  be  present  in  order  to  replace  the  failing  part  on-line.  If  the  needed  spare 
part  is  not  available  the  series-system  ia  out  of  service,  while  the  remaining  spares  for  the  other 
subsystems  are  of  little  use  for  the  moment. 

To  increase  the  reliability  of  temporarily  isolated  systems— e  g.,  aboard  a  ship  on  route- 
spare  parts  must  be  taken  along  in  a  sufficient  number.  A  shortage  during  this  period  of  isola¬ 
tion  not  only  decreases  the  availability  of  the  system  concerned  but  also  entails  an  enormous 
rise  in  the  costs  of  maintenance,  if  this  component  has  to  be  supplied  on  an  emergency  basis,  if 
possible  at  all. 

In  former  days  it  was  possible  to  provide  the  system  with  an  abundant  stock  of  spares.  If 
there  were  no  budgetary,  spatial  or  other  limitations  the  stocks  taken  along  could  be  extended 
to  such  a  high  level  that  for  a  certain  period  there  was  practically  no  shortage  of  any  spare. 
Nowadays,  however,  due  to  increased  complexity  and  consequently  high  prices  of  modern  sub¬ 
systems,  the  costs  of  such  a  stock  would  be  prohibitive. 

A  procedure  has  been  developed  based  on  dynamic  programming  by  means  of  which  the 
composition  of  an  optimal  kit  can  be  calculated  (9).  The  basic  idea  is  that  the  chance  of  a  shor¬ 
tage  of  any  spare  and  a  simultaneous  surplus  of  the  other  ones  is  as  small  as  possible.  An 
optimal  kit,  of  all  kits  of  the  same  cost,  guarantees  the  highest  reliability  of  the  series-system  to 
which  the  kit  is  joined. 
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The  spare  part  provisioning  in  behalf  of  modern  systems  is  a  logistics  problem,  which  can¬ 
not  be  solved  satisfactorily  only  by  best  engineering  judgement  based  upon  intuition  and  experi¬ 
ence,  as  happened  formerly.  A  numerical  approach  must  be  made  at  present  with  the  aid  of  the 
theory  of  reliability,  using  formulas  to  determine  the  chance  of  survival  of  a  unit  to  which  one 
or  more  units  in  standby  are  allocated. 

Many  papers  in  this  field  of  investigation  assume  an  exponentially  distributed  lifetime  of 
the  units  and  spare  parts  concerned.  This  distribution  is  a  one  parameter  distribution  and  com¬ 
pletely  defined  when  the  mean  lifetime  is  specified.  Characteristic  is  the  constant  failure  rate 
which  means  that  the  probability  of  failure  is  independent  of  age:  the  unit  is  as  good  as  new  at 
any  moment  of  its  lifetime.  A  certain  justification  for  assuming  an  exponentially  distributed 
lifetime  is  given  by  Palm’s  theorem  [5]  and,  of  course,  by  so  many  units  actually  showing  an 
exponentially  distributed  lifetime.  On  the  other  hand  the  exponential  distribution  is  not  flexible 
enough  to  describe  the  survival  curve  of  many  other  components  or  systems.  Therefore,  other 
failure  distributions  (such  as  Weibull,  gamma,  etc.)  are  proposed  [7,3,4],  These  distributions 
are  characterized  by  more  than  one  parameter  and  so  are  more  suited  for  fitting  to  observed 
reliability  curves  in  practice. 

A  drawback  of  most  of  the  nonexponential  distributions  is  that  it  is  difficult,  if  not  impos¬ 
sible,  to  obtain  a  closed  form  expression  for  the  effect  of  redundancy  by  using  the  general  pro¬ 
cedure  to  derive  an  expression  for  the  reliability  of  a  unit  with  one  or  more  spares.  In  this 
paper  two  generalized  exponential  distributions  are  studied,  together  covering  most  of  the  cases 
of  practical  interest;  for  both  distributions  a  formula  for  the  reliability  with  spares  is  derived. 

2.  RELIABILITY  OF  A  SINGLE  UNIT  WITH  K  SPARES 

In  this  section  the  reliability  of  a  single  unit  together  with  k  (identical)  spares  is  derived. 
The  procedure  holds  for  every  life  distribution  [2,6], 

Figure  1  shows  a  single  unit  (equipment,  component,  system)  on-line,  to  which  k  identi¬ 
cal  spares  are  allocated,  k  =  0,  1,  2,  . . .  .  When  unit  1  fails,  unit  2  in  standby  is  put  in  opera¬ 
tion,  while  unit  1  is  taken  off-line.  This  process  will  be  repeated  until  the  kit  of  k  spares  is 
exhausted. 


1 


CZD - o 

2 


Figcxe  1.  Unit  1  on  tine,  k  spares  allocated. 


Let  X,  be  the  lifetime  of  the  /  ,.i  unit,  /  =■  1,  ...  ,  k  +  1.  Then  the  lifetime  Yk+i  of  the 
whole  system  consisting  of  one  unit  with  k  spares  may  be  written  as 
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A  i-l 

=  i  a', 

/-i 

Lei  ail )  be  the  same  probability  density  function  (p.d.f.)  for  all  .Y,,  fit)  the  p.d.f.  of  Yk  +  \. 
Then  renewal  theory  states  the  following  relationship  between  the  two  functions  /and  a: 

Lf  =  ( La)k 

where  Li;  is  the  Laplace  transform  of  a  function  Using  the  inverse  transformation  one  finds 
the  function  / 

Finally,  we  can  find  Rk+\it)  by  integration  of/-/): 

(2.1)  /?^i(r)  =  J°°  fiaidu.  i  $s  0. 

This  general  procedure,  if  applied  to  the  exponential  distribution  air)  =  Ke~kl,  gives  the  well- 
known  results: 

x*+» 

fit)  =  — —  ike~K' 

(2.2)  A- =  0.1 . 

k  (\i)‘ 

**-,<'>  =  e~K'  1 

i-o  '• 


3.  THE  HYPER-EXPONENTIAL  AND  ERLANG  DISTRIBUTIONS 


As  mentioned  in  the  introduction,  most  of  the  common  life  distributions  unfortunately 
are  unsuitable  for  the  application  of  the  general  procedure  described  in  Section  2.  In  most 
cases  it  is  difficult  or  even  impossible  to  calculate  the  Laplace  transform  of  the  density  function 
ail).  Even  if  the  transform  of  ait)  can  be  calculated,  the  expression  is  so  complex  that  calcu¬ 
lation  of  the  inverse  transform  is  difficult.  Although  in  many  cases  numerical  methods  are 
available  for  common  distributions,  such  as  the  Weibull,  closed  form  expressions  for  the  effect 
of  spare  parts  in  standby  are  not  yet  known.  It  must  be  emphasized  that  this  lack  of  knowledge 
of  a  closed  form  expression  must  be  seen  as  a  more  and  more  disturbing  drawback,  as  the 
redundancy  is  of  increasing  importance  for  an  optimal  spare  part  policy.  Now  it  seems  that  two 
distributions  are  available  which  combine  the  two  operational  advantages,  namely,  the  suitabil¬ 
ity  both  for  curve  fitting  and  for  application  of  the  procedure  in  Section  2.  These  two  distribu¬ 
tions  expressed  by  their  reliability  functions  are,  respectively: 


1.  The  hyper-exponential  distribution: 

(3.1)  /?,(/)  =  u>e-*»'+  (1  -  «#)*-"-»>*•' 


0  ^  OJ  ^  1 
0  <  8  <  1 


H  >  0 

2.  The  Erlang  distribution: 

(3.2)  /?,(/>  =  <•-**'  £  L~  1.2 .  A  >  0. 

Tn  1  ■ 

Both  distributions  can  be  seen  as  generalizations  of  the  exponential  distribution:  the  hyper¬ 
exponential  is  exponential  if  a>  =  0,  to  =  1  or  8  =  1/2  while  the  Erlang  is  exponential  in  case 
L  =  1. 
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lo  compare  these  distributions  we  first  give  the  results  for  mean  and  variance.  »u  is  the 
mean  of  the  hyper-exponential  distribution. 


Mean 

Variance 

Hyper-exponential 

01  |  1  —  OJ 

M 

j  (  2o»(I  -  u>)(l  -  28 ) 2 

8  1-8 

(u>  —  2o>8  +  8)2 

Erlang 

X 

1 

Lk 2 

Characteristic  is  that  the  standard  deviation  is  always  less  than  or  equal  to  the  mean  in  case  of 
an  Erlang  distribution  where  the  opposite  is  true  for  the  hyper-exponential  distribution. 


The  hazard  function  of  the  hyper-exponential  distribution  is  given  by 


(3.3) 


8oj  +  ( 1  —  6)(1  -  tp)e-(|-28^' 
<o  +  (1  —  at) 


Simple  analysis  gives  that  z  is  a  monotonic  decreasing  function  and 
lim  z(t)  =  8 fi. 

/— oo 

In  Figure  2  some  graphs  of  the  hazard  function  are  given  for  particular  values  of  the  parameters 
(resulting  in  the  same  mean  value). 


Figure  2.  Graphs  of  (he  hazard  function  for  the  hyperexponential  distribution. 
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>>  1 


Will)  (lie  hyper-exponential  distribution  we  .ire  able  l<>  describe  .1  high  I'.nliire  rale  m  the 
first  pari  of  ihe  lifetime  of"  a  unit  After  a  ''hum  in"  period  ihe  failure  is  appro\imaiely  eon- 
slant 


I  lie  hazard  fimelion  of  ihe  lilang  dislrilunion  is  an  increasing  luneiion  of  lime  111  and 
lim  el  1)  -  l.K  in  1  h is  ease  So  those  two  distributions  form  a  group  whose  standard  deviations 

cover  all  values  from  zero  to  infinity  and  die  same  is  true  for  their  hazard  funetions  I  hose  dis¬ 
tributions  have  proven  10  represent  adequately  many  real  lifetimes  of  systems  and  units  |(>| 
figure  .?  shows  the  reliability  futielion  of  both  1  rlang  and  hyper-exponential  distributions, 
where  time  is  normalized  with  respeet  to  the  mean  lifetime 


I  *  l<  i  1  iri  }  Irliing  and  hypcicxponcnii.il  vlisinhuiions 


4.  THE  RELIABILITY  Rk , , <l )  IN  CASK  OF  THE  II  YPKR-KXI’ONKN TIAI. 
DISTRIBUTION 

Ihe  probability  density  function  <1(1)  of  Ihe  hyper-exponential  distribution  is  given  by 
r id)  fi/u.M ■  t  (I  f> )/<(!  »,»)<-  " 

If  ft  ~  1/2  or  01  -  0  or  ( 1 >  I,  </(r)  becomes  the  density  function  of  an  exponential  distribu¬ 
tion.  Results  for  being  known  m  this  ease,  we  only  study  Ihe  other  values  of  6  ami  <«. 


VOI  2 <),  NO  4.  1)1  (  I  Mill  U  I'lx; 
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We  write  for  ait)  the  somewhat  shorter  notation 

ait)  *■  ce~al  4-  de~b'  with  a  ■*  8(i,  b  *■  (1  —  8)fi, 

c  —  8/iui,  </—(!  —  6)/a(l  —  to). 


c  +  d 


s  +  a  s  +  b 


Taking  the  Laplace  transform  of  ait): 

La  is) 
so 

L/is) 


fk  + 1) 

c 

k+\~)  J 

a 

l  j  1 

s  +  a 

s  +  b 

Making  some  preparations  for  (he  inverse  transformation,  yields: 


vw-!(7 


Ba 


with 


A„  - 


K 

By  +  X  -  d*+' 


1 

T  l‘  T  '1 

fk  -«| 

ia  -  b)*+|-“ 

-  c*+l. 

l/-  l) 

1 

*  V  ik  +  ii 

(k  -  P\i 

ib  -  a)*+1-* 

Si') 

(/-  ll' 

Actually,  taking  the  inverse  Laplace  transform,  we  find 

*+i  A  *+i  A. 

/(/)  .  y  - 2 -  ,a-l  -m  +  y  - - ,0-1  g~K 

«-t  («-  1>!  ft  o-  D! 


k, 


k. 


Integrating  fit)  we  find  the  reliability  ^*+i  (/)  of  a  single  unit  with  k-spares  in  case  of  a  hyper¬ 
exponential  failure  density  function  per  unit: 


(4.1) 


Application  and  combination  of  the  above  mentioned  results  yields  the  following  formula  for 

Rk ♦,</)•• 


(4.2) 


Ry  +  \it) 
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with 


cy-w*+,|l+  X 

i-j+ 1 


28  -  1 


k+\—i 


dj-  (1  -o>)*+,{l  +  X 

i-j  +  l 


-  8)(1  -  to) 


j  -  0,  1,  ...  .  k  —  1. 


ck  -  tok+\  dk  -  (1  -  a>)k+l. 


>  • 


I  -• 


5.  THE  RELIABILITY  Rk+,(t)  IN  CASE  OF  THE  ERLANG  DISTRIBUTION 


Differentiation  of  the  Erlang  reliability  function  with  parameters  L  and  X  yields  the  fol¬ 
lowing  expression  for  the  probability  density  function  of  the  lifetime  of  a  single  unit: 

Uk)L 


ait) 


iL  ~  D! 


,L~\e- 


Lki 


Taking  the  Laplace  transform  gives 


Lais) 


iLk)1 
( s  +  Lk)L  ' 


hence 


lf(  t  iLk)'k+"L 
7  ”  (s  +  Lx)(*+I)i 


We  must  take  the  inverse  transform  of  the  expression  for  fit): 


fit) 


Uk)ik+'u 

u*  +  in  -  in 


,(*+l)L-l  e-Lkl 


> 


I  *•>', 


I 


I 


again  a  p.d.f.  of  an  Erlang  distribution  with  parameters  (X  +  1)L  and  — — -  Thus,  we  find 

k  +  1 

for  the  reliability  R*+i(f)  of  a  single  unit  which  k  spare  parts  in  case  of  the  Erlang  distribution: 


(5.1) 


R*+i(r) 


e-U,  £ 

;-o 


iLktV 

/! 


6.  A  NUMERICAL  EXAMPLE 

To  applicate  the  formula  of  Section  4  we  study  a  system  of  units  where  all  the  units  have 
a  hyper-exponentially  distributed  lifetime.  We  assume  for  the  parameters  the  following  values: 
to  *■  0.7,  8  =  0.01,  and  n  =  50.  This  means  that  each  unit  has  a  mean  lifetime  of  1.4  with  a 
standard  deviation  of  1.9. 
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If  we  have  to  allocate  spares  to  the  system  so  that  a  99%  reliability  exists  at  t  —  10  we  see 
from  Figure  4  that  18  spares  are  sufficient.  If  we  base  computations  on  the  exponential  distri¬ 
bution  with  the  same  mean  lifetime  of  1.4  we  find  that  14  spares  already  give  the  same  reliabil¬ 
ity  at  t  —  10.  And  if  14  spares  are  allocated  to  this  system  where  the  units  have  an  hyper¬ 
exponential  distribution  only  a  96%  reliability  at  t  *  10  can  be  given.  Generally,  the  hyper¬ 
exponential  distribution  is  more  conservative  in  estimating  number  of  spares  than  the  exponen¬ 
tial  distribution  with  the  same  mean. 


Figure  4.  Graphs  of  R*+)(r)  in  case  of  hyperexponential  distribution  with  u  —  .7  6  —  .01  and  p.  —  50. 
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ABSTRACT 

In  a  recent  paper,  Kent  and  Quesenberry  119)  considered  using  certain  op¬ 
timal  invariant  statistics  to  select  the  best  fitting  member  of  a  collection  of  pro¬ 
bability  distributions  using  complete  samples  of  life  data.  In  the  present  work 
extensions  of  this  approach  in  two  directions  are  given.  First,  selection  for 
complete  samples  based  on  scale  and  shape  invariant  statistics  is  considered. 

Next,  the  selection  problem  for  type  I  censored  samples  is  considered,  and  both 
scale  invariant  and  maximum  likelihood  selection  procedures  are  studied.  The 
two-parameter  (scale  and  shape)  Weibull,  lognormal,  and  gamma  distributions 
are  considered  and  applications  to  real  data  are  given.  Results  from  a  (small) 
comparative  simulation  study  are  presented. 

1.  INTRODUCTION 

The  two-parameter  (scale  and  shape)  Weibull,  lognormal,  and  gamma  distributions  are  all 
commonly  used  in  reliability  and  life  testing  problems.  The  problem  of  selecting  one  of  these 
three  distributions  for  a  particular  sample,  either  complete  or  censored,  is  a  difficult  one.  In 
this  work  we  consider  basing  the  selection  on  the  values  of  certain  selection  statistics  computed 
from  the  sample.  Although  we  consider  selection  based  only  on  sample  information,  it  should 
be  noted  that  in  some  practical  problems  further  information  may  be  available  which  should 
also  be  weighed  in  the  final  selection  of  a  distribution.  Such  information  could  be  derived,  for 
example,  from  known  physical  characteristics  of  a  failure  mechanism  and  its  failure  rate. 

Also,  it  should  be  mentioned  that  throughout  this  paper  we  treat  the  selection  of  one  of 
the  distributions  as  a  forced  selection  problem.  That  is,  we  formally  select  exactly  one  of  the 
competing  distributions.  In  practice,  we  may  sometimes  wish  to  use  a  partial  selection  pro¬ 
cedure  which  does  not  necessarily  always  select  one  distribution  over  the  others  when  they  are 
close  together,  as  indicated  by  selection  statistics  that  are  nearly  equal.  The  selection  statistics 
posed  here  may  be  readily  used  in  partial  selection  schemes.  However,  we  do  not  explicitly 
consider  these  applications  in  the  present  paper. 

Kent  and  Quesenberry  [19],  KQ,  proposed  a  forced  selection  procedure  based  on  statistics 
that  are  invariant  under  scale  transformations.  Other  relevant  literature  includes  a  paper  by 
Dumonceaux,  Antle,  and  Haas  [11]  who  examined  maximum  likelihood  ratio  (MLR)  test1-  for 
discriminating  between  two  models  with  unknown  location  and  scale  parameters,  and  compared 
empirically  the  power  of  MLR  tests  with  that  of  uniformly  most  powerful  invariant  (UMP1) 
tests  for  discriminating  between  normal  and  Cauchy  distributions.  They  actually  recommend 
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the  MLR  test  over  the  UMPI  test  on  the  basis  of  relatively  good  power  and  ease  of  computa¬ 
tion.  Dumonceaux  and  Antle  [10]  gave  an  MLR  procedure  for  discriminating  between  Weibull 
and  lognormal  distributions  that  is  based  on  the  fact  that  the  logarithms  of  both  Weibull  and 
lognormal  random  variables  have  location-scale  parameter  distributions.  In  a  recent  paper,  Bain 
and  Engelhard!  [2]  considered  a  likelihood  ratio  selection  statistic  for  selecting  between  gamma 
and  Weibull  distributions. 

Some  graphical  procedures  for  the  selection  problem  have  been  given  by  Nelson  (2 1 ),  and 
by  Barlow  and  Campo  [.1].  Other  papers  that  are  related  to  the  present  work  include  Hogg, 
Uthoff,  Randles,  and  Davenport  [18],  who  discuss  a  number  of  selection  procedures,  including 
one  based  on  location-scale  invariant  statistics;  Dyer  {[12], [13])  who  considers  a  number  of 
selection  procedures  for  discriminating  between  pairs  of  classes  of  location-scale  distributions; 
and  Uthoflf  ([24], [25])  who  considers  some  particular  invariant  statistics.  As  general  references 
for  invariant  tests  see  Hajek  and  Sidak  [15]  and  Lehmann  [20],  and  for  MLR  tests  see  Cox  [8]. 
Volodin  [26]  considers  a  generalized  three-parameter  gamma  distribution  and  discriminates 
between  two-parameter  gamma  and  Weibull  distributions  by  making  scale  invariant  tests  on  the 
other  parameters. 

As  mentioned  above,  KQ  considered  selecting  among  the  gamma,  lognormal  and  Weibull 
families  for  the  complete  sample  problem.  The  selection  statistic  posed  was  formed  by  first 
deriving  a  scale  invariant  statistic  that  is  optimal  in  the  sense  that  it  minimizes  the  sum  of  the 
two  probabilities  of  selecting  the  incorrect  distribution  for  two  conformable  distributions  (cf. 
Quesenberry  and  Starbuck  [22]);  and  then  replacing  the  shape  parameter  by  its  ML  estimator. 
Such  procedures  were  called  suboptimal  and  the  selection  statistics  for  the  three  families  were 
set  out  in  simple  closed  form  formulas  in  that  paper. 

In  the  present  work  we  consider  two  major  changes  in  the  approach  and  problem  con¬ 
sidered  in  KQ.  First,  for  the  lognormal  and  Weibull  families  we  use  optimal  scale  and  shape 
invariant  selection  statistics.  Also,  we  consider  the  selection  problem  for  type  I  censored  sam¬ 
ples  as  well  as  for  complete  samples.  For  these  cases  the  selection  statistics  are  generally 
expressed  as  definite  integrals  whose  evaluation  requires  numerical  integration.  Thus,  a  sub¬ 
stantial  part  of  this  work  has  necessarily  been  concerned  with  the  development  of  computer 
algorithms  to  evaluate  these  integrals. 

2.  DENSITIES  AND  SELECTION  FUNCTIONS 

In  many  applied  problems  it  is  reasonable  to  assume  that  the  location  parameters  of  life 
distributions  are  known.  Thus,  we  consider  distributions  with  only  scale  and  shape  parameters 
unknown.  The  densities  of  the  gamma,  lognormal  and  Weibull  distributions  to  be  studied  are 
given  in  Table  1.  The  symbol  liab)(x)  in  Table  1,  and  elsewhere,  is  the  indicator  function  of 
the  interval  (a,  6),  i.e.,  Iiabt(x)  =  1  if  a  <  x  <  b,  and  is  zero,  otherwise. 

We  consider  type  I  censored  samples,  which  are  obtained  when  a  number  of  items  are  put 
on  life  test  and  observed  for  a  previously  specified  fixed  lime  T.  Thus,  the  parent  density  for 
the  observed  lives  is  a  truncated  version  of  the  complete  samples  density  given  in  Table  1. 

The  approach  used  here  is  of  the  same  general  form  as  that  in  KQ.  A  selection  statistic,  S, 
is  defined  for  each  of  the  three  parametric  classes,  and  the  class  with  the  largest  selection  statis¬ 
tic  is  chosen  as  the  best  fitting  family  for  a  given  sample. 
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TABLE  1  —  Densities  of  Weibuli  Lognormal,  and  Gamma  Distributions 


Name 

Symbol 

Density 

Weibuli 

mo,  p) 

'■  -  f 

A 

a  i 

exp  |-(.v/«)',l  ■  /,(,  *,<  v). 

u 

Lognormal 

LXlo.  it) 

—  _ 

i 

— -  •  exp{-(ln<  v/w)|:/2<r’]  ■  /„)oo,(.v); 

0 

Ga  mma 

GUI.  a) 

v  2 

u  =  a " 

7 T  <r.v 

[!<«)]  Lx"  'exp(-.v/w)  ■  /,(lx|l.v); 

0 

We  consider  some  transformation  properties  of  these  distributions  before  deti 
lion  statistics  for  them.  If  X  is  a  random  variable  with  either  a  GIO.  a).  T/V(«,  <r)  m 
distribution,  then  consider  the  transformation 

(2.1)  Y  =  aX'\  a  >  0.  h  >  0 

If  A-  is  a  W(d,  fi )  random  variable,  then  Y  is  a  IV(a9h,  i/b)  random  variable,  amt 
LN(t f  <r )  random  variable,  then  Y  is  a  LNiaOb,  ber )  random  variable.  Thus,  Weibuli 
variables  are  transformed  to  Weibuli  random  variables  by  (2.1),  and  lognormal  rand< 
ables  are  transformed  to  lognormal  random  variables  by  this  transformation.  Unfortuna 
present  purposes  at  least,  the  gamma  distributions  do  not  share  this  property  since  a 
random  variable  is  not  always  transformed  to  another  gamma  random  variable  by  (2  I  > 
to  say,  the  G(0,  a)  class  is  not  a  scale-shape  class  that  is  conformable  with  the  lognnt 
Weibuli  classes  as  defined  by  Quesenberry  and  Starbuck  [22].  Nevertheless,  we  use 
selection  statistic  for  the  complete  samples  problem  which  is  essentially  the  value  of  the 
function  of  a  maximal  invariant  when  each  of  the  three  parents  is  assumed. 


For  .V) . ,v„  an  observed  sample,  we  define  the  selection  statistic  for  a  den' 

tion  f  (/'  =  1.  2,  3)  by 

(2.2)  S,  =  /o“/o”./;,y.vf . yv,* )y"  V'^Lv,  ...xn)KdydK. 


Due  to  the  property  of  the  Gin,  a)  distribution  discussed  above,  the  selection  sta"' 

(2.2)  is  a  function  of  the  parameter  «  We  obtain  a  selection  statistic  by  replacing 
maximum  likelihood  estimator,  a,  in  this  function.  The  selection  statistics  used  in  ti 
are  given  in  Table  2. 


TABLE  2  —  Scale-shape  Invariant  Selection  Statistics  for  Complete  Samples 


Family 

i 

s, 

W(9,  fi) 

i 

T(n)  /o°°  (Ux,)*ax,x)  "\"-2</X 

LN(e,  <r ) 

2 

1/2  n-i/27r-'«,'*-"r|l/2(o  -  1)][I  ln2x,  -  (I  In  xf)2/nl,/2' 

G(e,  a) 

3 

rinair-^a)  (  fi  x,)“A(I x,x)-"“\"-2</A 

•^0 
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The  evaluation  of  these  functions  requires  numerical  integration,  except  for  the  lognormal 
selection  function.  It  is  often  computationally  easier  to  compute  and  compare  the  logarithms  of  I 

the  selection  statistics  than  the  statistics  themselves.  To  estimate  the  parameter  a  of  the 
gamma  distribution,  we  use  the  ML  estimator  a  of  Greenwood  and  Durand  114)  which  has 
been  studied  further  by  Bowman  and  Shenton  [6],  and  was  recently  used  and  given  in  detail  by 
KQ.  The  selection  procedures  proposed  here  are  closely  related  to  uniformly  most  powerful 
invariant  (UMPI)  tests  for  separate  families  testing  problems.  For  the  particular  case  of  select¬ 
ing  between  lognormal  and  Weibull  distributions,  using  the  selection  statistics  of  Table  2  is  I 

equivalent  to  using  the  UMPI  test  statistic  for  classifying  a  sample  into  one  of  these  two  distri¬ 
butions.  If  <*i  denotes  the  probability  that  a  sample  from  a  lognormal  parent  will  be  classified 
as  a  Weibull  sample,  and  a 2  the  probability  that  a  Weibull  sample  will  be  classified  as  a  lognor¬ 
mal  sample,  then  the  above  selection  procedure  will  minimize  +  a2  among  all  procedures 
invariant  with  respect  to  the  transformations  of  (2.1).  Or,  if  the  probability  of  selecting  each 
distribution  is  1/2,  then  this  procedure  minimizes  the  total  probability  of  niisclassification,  viz.,  I 

(a\  +  a2)/ 2  (see  KQ,  Sec, ion  3). 

3.  SIMULATION  RESULTS  FOR  COMPLETE  SAMPLES 

In  this  section  we  report  results  of  a  simulation  study  of  the  selection  rules  proposed 
above.  ) 

In  order  to  obtain  results  that  can  be  compared  with  those  of  KQ,  samples  were  generated 
from  nine  parent  distributions:  W(l,  1/2),  W(l,  2),  IV(1,  4),  LN (1,  0.4),  LN(l,  1), 

LN (1,  2.5),  GO,  1/2),  GO,  2),  and  GO,  5)  for  n  =  10,  20,  30.  One  thousand  samples 

were  generated  from  each  of  these  distributions.  The  pairwise  selection  error  rates  are  given  in 

Table  3  and  the  observed  rates  of  correct  classification  in  the  3-way  procedure  are  given  in  9 

Table  5. 

The  entries  in  Table  3  are  read  as  follows,  using  the  first  set  of  results  as  an  example. 

The  selection  is  to  be  made  between  W  and  LN  families.  One  thousand  samples  of  size  10 

were  generated  from  a  W  distribution,  and  28  percent  of  these  samples  were  classified  as  being 

from  an  LN  distribution.  One  thousand  samples  of  size  10  were  generated  from  an  LN  distri-  ft 

bution  and  36  percent  of  these  samples  were  classified  as  being  from  a  W  distribution.  Note 

that  since  W  and  LN  are  conformable  scale-shape  families,  these  results  do  not  depend  upon 

which  particular  members  of  the  families  that  are  involved. 

This  procedure  has  total  error  probabilities  for  the  case  of  a  lognormal  vs  a  Weibull  that 
are  the  smallest  possible  for  a  scale-shape  invariant  procedure.  Comparison  of  the  results  for  p 

this  case  given  in  Table  3  and  those  in  Table  4  of  KQ  shows  that  the  present  optimal  procedure 
has  very  little  advantage  over  the  suboptimal  procedure  of  KQ.  For  the  other  two-way  selection 
problems,  gamma  vs  lognormal  and  gamma  vs  Weibull.  the  comparisons  of  the  procedures  of 
this  paper  with  those  of  KQ  are  not  clearcut.  This  is  because  the  gamma  distribution  does  not 
admit  a  UMPI  statistic  with  respect  to  the  transformations  of  (2.1). 

i 

In  view  of  these  observations  we  recommend  the  selection  procedures  set  out  in  KQ  on 
the  grounds  that  (i)  the  selection  statistics  in  Table  2  of  KQ  have  convenient  formulas  that  are 
readily  evaluated,  and  (ii)  the  error  rates  achieved  by  those  procedures  appear  to  be  about  as 
favorable  as  for  those  achieved  by  the  much  more  computationally  difficult  scale-shape  invari¬ 
ant  procedures. 
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TABLE  3  —  Selection  Error  Rates  for  Pairwise  Procedures* 


n 

X  ~  w 

X  -  LN 

Total 

10 

.36 

.28 

.32 

20 

.21 

.23 

.22 

30 

.15 

.18 

.16 

n  X-~G(\/2) 

X  W(\/2) 

Total 

X  ~  G(  1/2) 

X~  (V(2) 

Total 

.40 

.37 

.34 


X  -  (7(1/2) 


.40 

.37 

.34 


.42 

.38 

.35 


X  -  H04) 


.40 

.29 

.23 


M/(2) 


.44 

.36 

.33 


IV (1/2) 


.42 

.38 

.35 


■  -  W{4) 


.44 

.36 

.33 


—  W  (1/2) 


.42 

.38 

.35 


4 

.40 

.42 

2 

.29 

.36 

0 

.23 

.31 

X  —  (7(1/2)  X  ~  LN (2.5) 


.25  .23  .24 

.15  .14  .15 

.09  .08  .08 


’  -  LN  (1) 


10  .37  .33  .35 

20  .31  .27  .29 

30  .24  .21  .22 


’  ~  (7(5)  X LN( 0.4) 


.44  .41  42 

.36  .38  .37 

.33  .35  .34 


'  ~  LN( 2.5) 


.44  .23  .33 

.36  .14  25 

33 _  08 _ .20 _ 

*Sce  Section  3  for  explanation  of  table  entries  and  discussion  of  these  results 


LN(0. 4) 


.41 


1 


10 

.39 

.40 

.39 

20 

.31 

.29 

.30 

30 

.30 

.23 

.26 

n 

X -  (70/2) 

LN(. 04) 

X-  (70/2) 

X~  LN(\) 

.25 

.41 

.33 

.25 

.33 

.29 

.15 

.38 

.27 

.15 

.27 

.21 

.09 

.35 

.22 

.09 

.21 

.15 

L. 
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TABLE  4  —  Total  Error  Rates  for  Gamma  vs  Weibullfor 
Likelihood  Ratio  Procedure  of  Bain  and  Ennelliardt 


a 

n  (3: 

.5 

2 

4 

.5 

10 

.435 

.405 

.385 

20 

.375 

.345 

.320 

2 

10 

.470 

.440 

.420 

20 

.415 

.385 

.360 

Also,  Bain  and  Engelhardt  [2]  give  in  their  Table  2  some  probabilities  of  correct  selection 
between  gamma  and  Weibull  distributions  using  a  likelihood  ratio  test  statistic.  Their  results 
can  be  used  to  construct  total  error  rates  comparable  to  those  of  Table  3,  for  a  few  selected 
values  of  the  gamma  and  Weibull  shape  parameters.  We  have  computed  these  values  and  give 
them  in  Table  4.  Comparison  of  the  total  error  rates  of  Tables  3  and  4  shows  no  trend  in  favor 
of  either  procedure. 

Table  5  gives  the  selection  rates  in  our  simulation  study  for  the  three-way  scale-shape 
invariant  selection  procedure.  The  entries  in  in  Table  5  are  read  as  follows,  using  the  first  set 
of  results  as  an  example.  The  selection  is  to  be  made  among  the  gamma,  Weibull  and  lognor¬ 
mal  distributions.  One  thousand  samples  of  size  10  were  generated  from  a  (7(1,  1/2)  distribu¬ 
tion,  of  which  57  percent  are  classified  as  G,  21  percent  are  classified  as  W  and  23  percent  are 
classified  LN  The  results  in  Table  5  can  be  compared  with  the  results  in  Table  5  of  KQ.  The 
comparisons  do  not  show  that  either  of  these  procedures  has  a  clear  advantage,  however,  the 
selection  procedure  of  KQ  may  have  a  slight  edge.  Thus,  as  for  the  two-way  selection  pro¬ 
cedures  above,  we  favor  the  computationally  simpler  scale  invariant,  SI,  procedure  of  KQ. 

TABLE  5  —  Selection  Rates  for  Tliree-Way  Procedure 


. 

G 

W 

LN 

G 

io 

.57 

.21 

.23 

.20  A 

20 

.62 

.26 

.12 

.28  A 

:  30 

66 

.29 

.05 

.37  A 
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4.  SELECTION  WITH  CENSORING 


Suppose  that  from  a  random  sample  of  size  non  a  parent  random  variable  with  density 
and  distribution  functions  /  and  /,  respectively,  only  the  values  less  than  a  prespecified  time  T 
are  observed  If  r  is  the  number  of  values  less  than  T,  then  r  is  a  binomial  r\  with  probability 

function  b(r;  F(T).  n).  Let  x\ . xr  be  the  observed  values,  indexed  in  the  same  order  as 

the  original  sample,  and  ah,,  ,  a,,,  be  the  corresponding  order  statistics.  We  require  selec¬ 
tion  procedures  based  on  the  values  X\,  ....  x, ,  and  r.  We  have  studied  procedures  based  on 
scale-shape  invariance,  as  considered  above  for  complete  samples,  scale  invariance  as  in  KQ, 
and  maximum  likelihood  ratio  procedures.  Of  these  procedures,  only  the  SI  and  maximum 
likelihood,  ML ,  procedures  will  be  described  now,  since  these  procedures  will  be  recommended 
for  reasons  given  below. 


When  /  and  F  are  functions  of  parameters  W,  say  f„  and  /#,  the  likelihood  function 
corresponding  to  ,V| . x,  and  r  is 

M-Vid . *(r).  r\  T)  = 

(4.1) 


{»!/<»  -  r)!)U  -  ^(n}"-'n/A(a>/<0.n<*(o)- 

i-t 


The  scale  invariant  selection  statistic  is  defined  by 

(4.2)  S  =  J"o  L  (AA(d . AA(r),  r\  kT)k'~'dk, 


where  the  scale  parameter  in  L  has  been  set  equal  to  one,  and  S  depends  upon  a  shape  parame¬ 
ter.  The  shape  parameter  in  S  for  each  of  the  three  families  considered  here  will  be  replaced  by 
its  maximum  likelihood  estimator,  obtained  by  maximizing  the  likelihood  in  (4.1). 

We  also  consider  selection  for  the  censored  case  using,  essentially,  a  likelihood  ratio  pro¬ 
cedure  In  this  approach  we  use  the  maximum  value  of  the  likelihood  function  in  (4.1)  as  the 
selection  statistic.  Formally,  the  selection  statistic  is 

5  =  sup  {/.#(a(|, . x(f),  r\  D), 

H 

(4.3) 

”  L^{x( |),  ...  ,  A(,,f  r,  7"), 


for  »  the  ML  estimator(s)  of  0. 

The  selection  functions  for  these  two  methods  for  the  three  families  arc  given  in  Table  6. 

We  have  written  programs  to  evaluate  the  selection  statistics  of  Table  6.  A  brief  descrip¬ 
tion  of  this  work  follows  in  the  remainder  of  this  section.  For  more  detail  see  Siswadi  [231. 

Maximum  likelihood  estimates  for  the  scale  and  shape  parameters  are  required  for  the 
ML  selection  functions,  and  for  the  shape  parameter  for  the  SI  selection  functions.  For  the 
Weibull  class,  these  estimates  were  obtained  as  solutions  of  the  ML  equations  in  Cohen  |7). 
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TABLE  6  —  Selection  Statistics  for  Censored  Samples 


Note:  r(a.  b)  **  f°  exp  (-s)ds,  <b  is  AtO. 1)  df. 

0 


For  the  lognormal  class  solutions  for  the  ML  equations  were  obtained  using  results  of  Harter 
and  Moore  [17],  adjusted  for  type  l  censoring.  Another  procedure  for  lognormal  type  l  cen¬ 
sored  samples  is  given  by  Aitchison  and  Brown  [1].  Solutions  of  the  ML  equations  for  the 
gamma  class  were  obtained  from  the  results  of  Harter  and  Moore  [16],  adjusted  for  type  I  cen¬ 
soring.  After  the  ML  estimates  of  the  scale  and  shape  parameters  are  obtained,  the  evaluations 
of  the  ML  selection  functions  of  Table  6  are  straightforward. 
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After  the  ML  estimate  /3  of  the  shape  parameter  of  the  Weibull  distribution  is  obtained, 
the  SI  selection  function  is  readily  evaluated.  However,  the  selection  functions  for  both  the 
lognormal  and  gamma  scale  invariant  procedures  are  difficult  to  evaluate,  and  we  have  used 
Monte  Carlo  and  importance  sampling  from  the  normal  distribution  and  gamma  distribution, 
respectively  (see  Davis  and  Rabinowitz  [9])  to  evaluate  them. 

5.  SIMULATION  RESULTS  FOR  CENSORED  SAMPLES 

We  have  conducted  a  small  Monte  Carlo  simulation  study  of  the  two  selection  methods 
discussed  above  for  censored  samples  to  provide  some  information  on  the  error  rates  for  these 
procedures.  These  empirical  error  rates  allow  comparisons  of  the  two  procedures  with  each 
other  as  well  as  with  complete  sample  rates  given  in  Section  3  and  in  KQ.  Comparison  with 
complete  sample  rates  gives  a  measure  of  the  loss  of  information  due  to  censoring. 

The  families  of  distributions  considered  were  If' (1/2) ,  R/(4),  G(l/2),  G(2),  LN( 0.4), 
and  LN (2.5);  and  the  sample  size  was  n  =  30  in  all  cases.  The  truncation  point  T  was  chosen 
so  that  the  df,  F(T)  —  0.90,  i.e.,  for  a  mean  rate  of  10%  censoring.  One  hundred  samples 
were  generated  for  each  of  the  above  distributions  except  IV  (4),  for  which  16  samples  were 
generated.  The  running  time  for  some  cases  was  very  long  and  this  limited  the  nnmber  of  sam¬ 
ples  that  could  be  generated. 

The  misclassification  rates  for  pairwise  selection  are  given  in  Table  7,  which  is  comparable 
to  Table  3  for  the  complete  samples  case.  Note  in  Table  7  that  the  ML  and  SI  procedures  give 
similar  error  rates  for  W  vs  LN  and,  in  fact,  both  give  the  same  total  error  rate  of  0.29.  Com¬ 
parison  of  these  results  with  those  of  the  SI  procedure  for  complete  samples  (see  Table  4  of 
KQ)  shows  that  there  is  a  rather  large  loss  of  information  due  to  censoring  since  the  IT,  LN 
and  total  error  rates  are  0.19,  0.15,  and  0.17,  respectively. 


TABLE  7  —  Misclassification  Rates  for  Pairwise  Selection 
Procedures-Censored  Sample  (n  =  30) 


Procedure 

X  -  W 

X  -  LN 

Total 

ML 

.40 

.18 

.29 

SI 

.45 

.12 

.29 

Procedure 

X  -  G  ( 1/2) 

X~  W{\/2) 

Total 

X  ~  G  ( 1/2) 

X  -  IV A) 

Total 

.38 

.32 

.35 

.38 

E| 

H 

.52 

.16 

.34 

.52 

MSI 

X  ~  <7(2) 

X  -  W(  1/2) 

X  -  G( 2) 

X  -  IV(4) 

ML 

.36 

.32 

.34 

.36 

.44 

.40 

SL 

.39 

.16 

.28 

.39 

.50 

.45 

X  G (1/2) 

X  -  LNI0A) 

X  ~  G  ( 1/2) 

X  -  LNI2.5) 

ML 

.14 

.32 

.23 

.14 

.08 

.11 

SI 

,20 

.31 

.26 

.20 

.09 

.15 

X  ~  G  (2 ) 

X  ~  LN(0A) 

X  -  G (2) 

X  -  LN (2.5) 

ML 

.35 

.32 

.34 

.35 

.08 

.22 

SI 

.42 

,3i 

.37 

.42 

.09 

.21 

VOL.  29.  NO  4.  DECEMBER  1982 


NAVAL  RESEARCH  LOGISTICS  QUARTERLY 


•  • 


I  • 


I  • 


»  • 


I  *#• ; 


9 


9 


9 


566 


SISWaDI  and  c  p  QUESENBERRY 


For  the  two-way  selection  error  rates  in  Table  7  that  involve  a  gamma  distribution,  neither 
the  ML  nor  the  SI  procedure  appears  to  have  an  overall  advantage.  Also,  by  comparing  these 
cases  with  the  same  cases  in  Table  4  of  KQ,  we  feel  that  the  loss  of  information  due  to  censor¬ 
ing  is  not  so  large  as  for  the  W  vs  LN  case  commented  on  above. 

The  classification  rates  for  three-way  selection  procedures  are  given  in  Table  8.  Again, 
neither  the  ML  nor  the  SI  procedure  appear  to  have  any  overall  advantage,  and  both  perform 
quite  well.  Also,  by  comparison  with  Table  5  of  KQ  it  appears  that  ten  percent  censoring  has 
little  effect  on  the  probability  of  correctly  classifying  a  lognormal  sample,  but  the  probabilities 
of  correctly  classifying  either  Weibull  or  gamma  samples  are  reduced  somewhat. 


TABLE  8  —  Classification  Rates  for  Three-Way 
Procedures— Censored  Sample  (tt  —  30) 


Procedure 

X  -  (7(1/2) 

X 

~  (7(2) 

X  ~~ 

W(  1/2) 

G 

W 

LN 

G 

W 

LN 

G 

w 

LN 

ML 

.60 

.29 

.11 

.29 

.36 

.35 

.27 

.35 

38 

SI 

.47 

.35 

.18 

.28 

.32 

.40 

.16 

.40 

.44 

X  -  W( 4) 

X 

X  ~ 

LA  (2.5) 

ML 

.13 

.56 

.31 

.23 

.09 

.68 

.01 

SI 

.82 

SI 

.25 

.50 

.25 

.23 

.08 

.69 

.01 

mm 

.86 

6.  A  USER  PROGRAM  AND  EXAMPLES  FOR  CENSORED  SAMPLES 

The  selection  procedures  for  the  three  families  of  distributions  have  been  proprammed  in 
FORTRAN.  A  listing  of  this  program  can  be  obtained  from  the  authors.  The  program  com¬ 
putes  the  selection  statistics  for  complete  and  censored  samples  according  to  the  formulas  gi'  en 
in  Tables  2  and  6,  respectively. 

For  the  scale  invariant  procedure,  the  selection  statistics  are  computed  by  the  Monte 
Carlo  method  given  in  Davis  and  Rabinowitz  [9],  The  program  was  tested  on  several  examples 
and  on  many  samples  produced  through  simulation.  In  general,  the  selection  statistics 
estimated  did  not  appear  reliable  for  heavily  censored  samples.  Therefore,  in  the  user  program 
for  the  scale  invariant  procedure,  the  selection  results  are  not  printed  if  the  coefficient  of  varia¬ 
tion  of  the  replicated  values  in  the  Monte  Carlo  method  is  larger  than  35%. 

EXAMPLE  1:  Birnbaum  and  Saunders  [5]  considered  a  set  of  data  of  lifetimes,  in 
thousands  of  cycles,  of  aluminum  sheeting  under  periodic  loading,  to  illustrate  the  gamma  fam¬ 
ily.  If  we  assume  that  the  experiment  was  terminated  at  a  prespecified  time,  say  T  —  1900, 
then  the  censored  observations  and  the  results  of  the  selection  procedure  are  presented  in  Table 
9.  For  these  data,  the  Weibull  family  is  selected  by  both  the  ML  and  SI  procedures,  however, 
the  selection  statistics  for  the  gamma  family  are  only  slightly  smaller.  It  is  also  to  be  noted, 
although  the  details  are  not  given  here,  for  the  complete  sample  the  selection  procedure  based 
on  the  selection  statistics  given  in  Table  2  yields  the  same  results. 
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TABLE  9  —  Results  of  Selection  Procedure 


Lifelimes  of  Aluminum  Sheeting  under  Periodic  Loading 


370 

706 

716 

746 

785 

797 

844 

855 

858 

886 

886 

930 

960 

988 

990 

1000 

1010 

1016 

1018 

1020 

1055 

1085 

1 102 

1102 

1108 

1115 

1120 

1134 

1 140 

1199 

1200 

1200 

1203 

1222 

1235 

1238 

1252 

1258 

1262 

1269 

1270 

1290 

1293 

1300 

1310 

1313 

1315 

1330 

1355 

1390 

1416 

1419 

1420 

1420 

1450 

1452 

1475 

1478 

1481 

1485 

1502 

1505 

1513 

1522 

1522 

1530 

1540 

1560 

1567 

1578 

1594 

1602 

1604 

1608 

1630 

1642 

1674 

1730 

1750 

1750 

1763 

1768 

1781 

1782 

1792 

1895 

1820 

1868 

1881 

1890 

1893 

Sample  Size  =  101 
Sample  Observed  =  91 
Truncation  point 


Maximum  Likelihood  Estimates 


Family  Scale  Shape 


Weibull  0.154149D  +  04  0.4041 14D 

Gamma  0.125214D  +  03  0.1 12550D 

Lognormal  0.135159D  +  04  0.317034D 


0.4041 14D  +  01 
0.1 12550D  +  02 
0.317034D  +  00 


Family  Selection  Statistic 


Maximum  Likelihood  Procedure 


Weibull  0. 147I64D  +  03 

Gamma  0. 146470D  +  03 

Lognormal  0. 144092D  +  03 


The  Family  Selected  is  Weibull 


Scale  Invariant  Procedure 


Weibull 

0. 144432D  +  03 

Gamma 

0.144103D  +  03 

Lognormal 

0.141762D  +  03 

The  Family  Selected  is  Weibull 
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EXAMPLE  2:  Bartholomew  [4,  p.  370]  gave  the  failure  times  of  15  items  that  failed  dur¬ 
ing  a  specified  period  of  testing  from  an  original  sample  of  size  n  =  20.  He  states  that  the 
items  have  an  exponential  life  distribution,  and  uses  the  exponential  distribution  to  perform 
analyses  of  the  data.  We  have  used  these  data  in  the  selection  program,  and  the  results  are 
given  in  Table  10.  Both  the  maximum  likelihood  and  scale  invariant  procedures  prefer  the  log¬ 
normal  distributions,  which  casts  some  doubt  on  the  assumption  of  an  exponential  parent  distri¬ 
bution. 


TABLE  10  —  Results  of  Selection  Procedure 


Bartholomew  Data 

3 

19 

23 

26 

27 

37 

38 

41 

45 

58 

84 

90 

99 

109 

138 

Samp 

e  size 

=  20 

Sample  Observed 

=  15 

Truncation 

point 

=  150 

Maximum  Likelihood  Estimates 

Family 

Scale 

Shape 

Weibull 

0. 105498D  03 

0. 108289D  01 

Gamma 

0  876146D  02 

0. 1 16892D  01 

Lognormal 

0.6825 17D  02 

0.122585D  01 

Family  Selection  Statistic 
Maximum  Likelihood  Procedures 

Weibull  -0.669185D  01 

Gamma  -0.66425 ID  01 

Lognormal  — 0.660273D  01 

The  Family  Selected  is  Lognormal 
Scale  Invariant  Procedure 

Weibull  -0.720101  D  01 

Gamma  — 0.699896D  01 

Lognormal  -0.667347D  01 

The  Family  Selected  is  Lognormal 
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ABSTRACT 

This  paper  is  a  case  study.  We  show  how  the  powerful  methods  of  time 
series  analysis  can  be  used  to  investigate  the  interrelationships  between  Alert 
Availability,  a  logistics  performance  variable,  and  Flying  Hours,  an  operational 
requirement,  in  the  presence  of  a  major  change  in  operating  procedures  and  us¬ 
ing  contaminated  data.  The  system  considered  is  the  fleet  of  C- 1 4 1  aircraft  of 
the  U.S.  Air  Force.  The  major  change  in  operating  procedures  was  brought 
about  by  what  is  known  as  Reliability  Centered  Maintenance,  and  the  contam¬ 
inated  data  were  due  to  anomalies  in  reporting  procedures.  The  technique  used 
is  a  combination  of  transfer  function  modeling  and  intervention  analysis. 


1.  INTRODUCTION  AND  SUMMARY 

In  January  1976,  the  U.S.  Air  Force  began  some  experimental  modifications  to  the  exist¬ 
ing  maintenance  policies  for  the  fleet  of 

C-141  aircraft.  These  modifications  were  a  part  of  a  Department  of  Defense  project  known  as 
'Reliability  Centered  Maintenance,'’  henceforth  denoted  by  RCM.  The  modifications  involved 
an  extension  of  the  maintenance  intervals  and  a  reduction  in  the  amount  of  scheduled  mainte¬ 
nance.  The  experimental  phase  of  the  project  ended  in  June  1977,  and  the  modified  policies 
were  officially  and  permanently  institued  at  that  time.  The  anticipated  benefit  from  RCM  was  a 
decrease  in  scheduled  maintenance  activity,  with  a  consequent  increase  in  "alert  availability." 
Alert  availability,  henceforth  denoted  by  AA,  is  the  instantaneous  probability  that  a  typical  air¬ 
craft  is  available  to  react  to  an  execution  order.  In  practice,  its  average  value,  over  say  a 
month,  is  computed  by  dividing  the  monthly  total  number  of  "fleet  operational  hours"  by  the 
total  number  of  "fleet  available  hours."t  A  plot  of  AA  from  October  1973  through  November 
1979  is  shown  in  Figure  1. 

•This  research  was  iniliated  and  supported  by  Air  Force  Logistics  Management  Center.  Contract  FOI600-79-D0146. 
Office  of  Naval  Research  Contract  N00014-77-C-0263  and  U.S.  Army  Research  Office  Grant  DA  AG-29-80-C-0067. 

••Major,  USAF 

t  Fleet  available  hours  is  the  cumulative  time  of  possession  across  a  fleet  of  aircraft,  whereas  fleet  operational  hours  is  the 
cumulative  amount  of  operational  time  (i.e..  up-time)  for  the  fleet. 
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Figure  1.  A  plot  of  C-141  alert  availably  from  October  1973  to  November  1979 

In  a  previous  study  (Singpurwalla  and  Talbott  15]),  we  investigated  the  effects  of  RCM  on 
several  variables  which  describe  what  is  known  as  the  "logistics  performance"  of  the  fleet.  Our 
conclusion  was  that  there  was  no  evidence  of  an  improvement  in  the  logistics  performance  of 
the  fleet  due  to  RCM;  on  the  contrary,  in  some  cases  there  was  a  clear  indication  of 
deterioration  in  performance.  These  conclusions  were  particularly  true  of  AA,  which  is  con¬ 
sidered  to  be  an  important  logistics  performance  variable.  One  criticism  of  this  previous  study, 
and  a  valid  or.3,  is  that  it  did  not  take  into  consideration  the  influence  of  other  "operational 
variables,"  which  in  addition  to  RCM  may  affect  the  logistics  performance  variables.  An  impor¬ 
tant  operational  variable,  which  is  suspected  of  being  strongly  related  to  the  AA,  is  "flying 
hours."  Flying  hours  is  the  total  number  of  hours  flown  by  the  fleet  of  the  C-141’sover  a  cer¬ 
tain  period  of  time,  say  one  month.  A  plot  of  the  monthly  flying  hours,  from  October  1973 
through  November  1979,  is  shown  in  Figure  2.  Another  criticism  of  our  previous  study,  and 
again  a  valid  one,  is  that  it  did  not  adequately  account  for  the  fact  that  some  of  the  AA  data 
were  "messy."  Specifically,  there  were  some  revisions  to  the  information  system  for  reporting 
the  operational  status  of  the  aircraft  that  resulted  in  some  possible  anomalies  in  the  reported 
values  of  the  AA. 

The  analysis  that  is  described  here  was  initiated  with  a  view  towards  rectifying  the  limita¬ 
tions  of  our  previous  study.  Here,  using  the  AA  and  the  flying  hours  as  examples,  we  demon¬ 
strate  how  a  procedure  for  investigating  the  interrelationships  between  the  two,  using  messy 
data  and  RCM  as  an  intervention,  can  be  developed  and  used.  The  approach  we  take  is  a  com¬ 
bination  of  those  described  in  Box  and  Jenkins  [1],  pp.  335-420)  for  the  analysis  of  multiple 
time  series,  and  in  Box  and  Tiao  [2]  for  intervention  analysis.  Our  conclusion  is  that  the  AA  is 
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Fi<ii  Rfc  2  A  plot  of C-141  flying  hours  from  October  1973  to  November  1979 


indeed  related  to  the  flying  hours,  as  has  been  conjectured,  but  that  even  after  taking  this  rela¬ 
tionship  into  consideration,  and  in  the  presence  of  messy  data,  our  previous  conclusion  still 
holds,  namely,  that  there  is  no  clear  evidence  of  improvement  in  the  AA  after  the  initiation  of 
RCM. 

Before  going  into  the  details  of  our  analysis,  we  want  to  emphasize  that  the  above  conclu¬ 
sion  and  its  practical  implications  are  not  intended  to  be  the  main  theme  of  this  paper.  Rather, 
our  aim  is  to  suggest  and  to  demonstrate  how  the  powerful  methods  of  time  series  analysis  *an 
be  used  to  analyze  the  messy  and  interrelated  data  that  often  arise  in  a  study  of  the  reliability 
and  the  logistics  performance  of  large  military  systems. 

In  what  follows,  we  presume  that  the  reader  has  a  knowledge  of  autoregressive  integrated 
moving  average  (ARIMA)  processes  and  is  familiar  with  the  notation,  terminology,  and  metho¬ 
dology  described  in  Box  and  Jenkins  [1]. 

2.  NOTATION,  PRELIMINARIES,  AND  AN  OUTLINE  OF  THE  PROCEDURE 

Let  X  and  Y  be  two  variables  of  interest,  and  let  X,  and  Y,  be  their  values  at  time  t.  In 
our  case,  we  let  X,  denote  the  total  flying  hours  for  the  fleet  of  C-I41's  during  the  rth  month, 
and  Y,  the  alert  availability  during  that  month;  t  varies  from  October  1973  through  November 
1979.  A  sequence  of  values  T,,T,_|,  . . .  will  be  denoted  by  |T,}. 

To  discern  the  relationship  between  X,  and  K,,  we  strive  to  obtain  a  linear  transfer  function 
model  (Box  and  Jenkins  (1 1,  p.  379)  of  the  form 

(2.1)  (I  -  6,0  -  ...  -  fi ,Br)  Y,  -  (h-o  -  wtB  -  ...  -  w%Bs) X,_b  +  N,, 
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where  B"'X,  «  .V,  ,,, .  m  »  0  I .  and  6, . S,.» .  «  are  unknown  constants  to  he 

estimated,  ft  represents  the  lag  of  with  respect  to  V ,  and  .V.  denotes  a  noise  component 
described  by  a  suiu.b!c  .aRIMA  process. 

Equation  (2  11  describes  the  alert  availability  at  time  t  in  terms  of  the  previous  values  of 
the  alert  availability  and  the  present  and  previous  values  of  the  flying  hours 

Following  the  procedures  suggested  by  Box  and  Jenkins  [1]  and  by  Haugh  and  Box  (3), 
we  first  consider  the  sequence  (series)  ( A', }  and  transform  it  to  a  "while  noise  series,"  {a,),  by 
way  of  an  appropriate  univariate  time  series  model.  Similarly,  we  also  reduce  the  sequence  {  Y,} 
to  a  white  noise  series,  {ft,),  using  an  appropriate  "intervention  analysis  model"  (see  Box  and 
Tiao  [2]).  Such  a  model  is  necessary  here  in  order  to  account  for  RCM  as  well  as  for  the  pres¬ 
ence  of  some  anomalous  observations  in  { K,|.  This  procedure  of  reducing  {Y,)  and  { F,|  to  {a,} 
and  {ft,),  respectively,  is  known  as  prewhitening  the  respective  series.  We  then  cross  correlate 
the  two  prewhitened  sequences  [a,]  and  { ft, )  in  order  to  obtain  an  indication  of  the  relationship 
between  (T,)  and  I  K,).  The  cross  correlation  function  is  used  to  suggest  values  for  r,s,  and  ft  in 
(2.1).  Finally,  we  use  the  prewhitened  sequences  {a,!  and  (ft,)  to  estimate  the  constants 
8 1 . 8r,  w0  ... .  ws  for  some  selected  values  for  r.  s.  and  ft. 


Regarding  the  prewhitening  of  { >',)  using  a  model  for  intervention  analysis,  we  remark 
that  intervention  c  te  to  RCV1  can  be  described  bv  a  sequence  of  indicator  variables,  say  (Z,). 
where  Z,  takes  a  value  of  0  for  all  i  representing  the  months  prior  to  January  1976.  and  a  value 
1  for  all  t  thereafter.  Recall  that  January  1976  is  the  date  of  intervention  — the  date  at  which 
experimental  RCM  was  initiated.  The  response  of  the  sequence  {  F,).  the  output  sequence  denot¬ 
ing  A  A,  to  the  input  sequence  { Z, }  can  take  various  functional  forms.  These  are  depicted  in  Fig¬ 
ure  3;  they  have  been  taken  from  Box  and  Tiao  (2).  We  strive  to  use  the  most  appropriate 
form  of  the  response  function  for  the  situations  at  hand. 

3.  PREWHITENING  THE  FLYING  HOURS  SERIES  {.V,| 

An  inspection  of  Figure  2  finds  that  the  fluctuations  of  the  series  about  its  mean  changes 
over  time.  In  such  situations  it  is  common  to  lake  the  natural  logarithms  of  the  data  Accord¬ 
ingly,  our  analysis  of  flying  hours  involves  a  logarithmic  transformation  of  the  original  data. 

We  find  that  flying  hours  can  best  be  described  by  an  ARIMA  (2,0,0)  *  (2,0,0)4  process. 

However,  the  residuals  from  this  model  reveal  significant  autocorrelations  at  lags  5,  10,  15 . 

etc.  Consequently,  we  fit  an  ARIMA  (0,0,0)  *  (0,0, 1  )5  model  to  these  residuals  Based  upon 
these  considerations,  an  appropriate  model  for  prewhitening  can  be  written  as 

(3.1)  (1  -  .54 B  +  llfl2)U  +  .38 B*  +  .37 £»)<*,  -  10.09) 

=  (I  +  ASB^a,. 

This  model  produces  residuals  whose  autocorrelation  function  (ACF)  and  log  spectral  density 
are  shown  in  Figure  4.  in  (a)  and  (b).  respectively.  These  residuals  are  listed  in  Table  I. 

4.  PREWHITENING  THE  ALERT  AVAILABILITY  SERIES  |  )',) 

In  our  previous  study  we  considered  alert  availability  as  two  separate  series,  one  ending 
prior  to  tanuary  1976  and  the  other  beginning  with  January  1976  We  found  that  the  loga- 
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b.  Slope  Response 
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d.  Step  Response 
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Figure  3.  Typical  forms  of  (he  response  {  Y, j  to  ihe  input  I Z,) 
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(b)  Graph  of  log  speclral  density 


Figure  4.  Behavior  of  the  ACF  and  the  log  of  the  speclral  density  of  the  residuals  from  an 
ARIMA  (0,0,0)  •  0.0, 1  >5  model  fit  to  residuals  from  an  ARIMA  (2.0.0)  •  2.0.0)4  model 
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TABLE  1  —  Residuals  from  Prewhitening  Models 
for  Flying  Hours  and  Alert  A  vailability 


TIME  T 

A  (T) 

B  (T) 

TIME  T 

A  (T) 

B  (T) 

JAN  1975 

-.131303 

JUL  1977 

.037152 

-.053020 

FEB  1975 

-.026086 

-.074950 

AUG  1977 

.022717 

.012580 

MAR  1975 

-.034914 

-.003150 

SEP  1977 

.021623 

-.045700 

APR  1975 

.283323 

.108000 

OCT  1977 

.080163 

.053860 

MAY  1975 

.060148 

.026820 

NOV  1977 

.037052 

.039220 

JUN  1975 

-.032649 

.056160 

DEC  1977 

—.048706 

-.063430 

JUL  1975 

.135207 

.085220 

JAN  1978 

-.047455 

-.011080 

AUG  1975 

.124353 

.105500 

FEB  1978 

-.058318 

.016660 

SEP  1975 

-.007629 

.039270 

MAR  1978 

.130172 

-.010630 

OCT  1975 

.059647 

-.003746 

APR  1978 

-.053118 

.013570 

NOV  1975 

.046232 

.035070 

MAY  1978 

-.008318 

.001221 

DEC  1975 

.020893 

.007938 

JUN  1978 

.020092 

-.023900 

JAN  1976 

-.025669 

.033680 

JUL  1978 

.036141 

.067210 

FEB  1976 

-.021861 

.054060 

AUG  1978 

-.073053 

-.013030 

MAR  1976 

.109077 

.095080 

SEP  1978 

-.002367 

-.015090 

APR  1976 

-.101662 

-.033110 

OCT  1978 

-.050561 

-.099890 

MAY  1976 

.001382 

-.081130 

NOV  1978 

-.054985 

.001532 

JUN  1976 

-.044392 

-.057940 

DEC  1978 

-.110742 

.036330 

JUL  1976 

-.012095 

.068650 

JAN  1979 

.041226 

.051630 

AUG  1976 

.068336 

-.175700 

FEB  1979 

-.043154 

-.043154 

SEP  1976 

-.047820 

.075940 

MAR  1979 

.106237 

.047260 

OCT  1976 

-.035721 

-.047270 

APR  1979 

-.173297 

-.027790 

NOV  1976 

-.066278 

-.066570 

MAY  1979 

.007886 

-.090300 

DEC  1976 

-.044366 

-.042450 

JUN  1979 

-.031905 

.030150 

JAN  1977 

.003072 

-.040260 

JUL  1979 

-.014306 

-.183300 

FEB  1977 

-.073529 

.076220 

AUG  1979 

-.027671 

.163600 

MAR  1977 

.045873 

-.014810 

SEP  1979 

-.030462 

.029880 

APR  1977 

.097139 

.062520 

OCT  1979 

.026290 

-.062220 

MAY  1977 

JUN  1977 

.008354 

-.089749 

.098570 

.137600 

NOV  1979 

-.070995 

-.015390 
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rithms  of  the  observations  in  both  these  series  could  best  be  described  by  an  AR1MA  (1,0,0)  * 
(1,1,0)6  model.  Here,  we  prewhiten  the  entire  series  by  considering  the  onset  f'f  RCM  as  an 
intervention  and  incorporating  this  effect  into  the  AR1MA  (1,0,0)  *  (1,1 ,0)6  model. 


Figure  1  indicates  clearly  that  the  AA  had  been  decreasing  after  January  1976,  the  onset 
of  RCM.  However,  the  rate  of  decrease  of  the  AA  during  the  experimental  phase  of  RCM  is 
greater  than  the  rate  of  decrease  during  the  post-RCM  phase.  This  suggests  that  the  effect  of 
RCM  during  the  experimental  phase  may  be  different  from  the  effect  after  the  experimental 
phase.  One  possible  reason  for  this  difference  is  that  there  was  a  piecemeal  introduction  of 
RCM,  air  base  by  air  base,  during  the  experimental  phase,  along  with  several  trial  revisions  in 
'he  maintenance  policies.  We  also  notice  some  large  spikes  in  value  of  the  AA  during  October, 
November,  and  December  1977  and  January  1978.  A  cause  for  these  large  values  may  be  that 
the  information  system  for  reporting  the  operational  status  of  aircraft  was  revised  during  this 
period.  The  AA  may  have  been  artificially  increased  during  this  period  due  to  anomalies  of 
reporting.  Thus,  it  appears  that  we  need  three  distinct  components  to  our  intervention  analysis 
model.  We  represent  these  components  by  three  indicator  variables,  J,,K,,  and  Z.,,  where: 


1 


0 


for  Jan  76  <  /  <  May  77 
otherwise 


K, 


1  for  Jun  77  <  t  <  Sep  77 
1  for  Feb  78  <  t  ^  Nov  79 
0  otherwise 


1 


L,= 


0 


for  Oct  77  <  t  <  Jan  78 
otherwise 


Note  that  the  union  of  the  three  sequences  {/,),  {/f,},  and  {£,)  constitutes  the  series  { Z,}, 
defined  earlier. 


As  for  the  functional  form  of  the  response  (series  (T,})  to  these  components  (see  Figure 
3),  we  remark  that  ramp  and  step  responses  appear  to  be  possible  candidate  forms  for  (/,}  and 
whereas  a  jump  response  is  appropriate  for  { L We  exclude  a  jump  response  for  the  first 
two  because  it  would  not  account  for  the  gradual  decline  evident  in  the  data,  and  we  c  ^sider  a 
slope  response  to  be  inappropriate  due  to  the  absence  of  a  sustained  leveling  of  the  urna.  We 
conjecture  that  a  step  response  might  have  a  period  of  three  months,  which  coincides  with  the 
length  of  time  between  scheduled  minor  inspections.  Our  aim  now  is  to  compare  the  functional 
forms  of  the  ramp  response  versus  the  step  response  in  order  to  determine  which  combination 
might  best  represent  the  relationship  between  AA  and  the  series  (Z,). 


We  fit  all  reasonable  combinations  of  response  forms  with  the  variables  K,,  and  Z,,, 
and  find  that  step  type  responses,  along  with  the  original  AR1MA  model,  i.e.,  a  model  of  the 
form 

(41)  Y‘  -  ttV7'  +  rrV*' +  aiL< + 
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where  N,  denotes  an  ARIMA  (1,0,0)  •  (1,1, 0)6  process,  produces  residuals  that  are  most 
satisfactory  (see  Figure  5).  The  model  (4.1)  describes  the  series  {F,}  as  a  step  function  with 
three-month  incremental  decreases  of  size  <*|  during  the  experimental  RCM  phase,  as  a  step 
function  with  three-month  incremental  decreases  of  size  a2  after  implementation  of  RCM,  and 
as  a  jump  function  with  height  a3  during  the  October  1977-January  1978  time  frame.  Thus  we 
have  the  prewhitening  model 

(4.2)  (1  -  .185)0  +  .3556) (1  -  Bb)Y,  = 

+  .  1 5  Lf  +  bt. 


I  • 


ft  • 


We  remark  that  the  estimated  parameters  a!  and  a2  are  negative,  as  expected,  with  a{  greater 
than  a2.  Also,  the  estimated  parameter  a3  is  positive,  which  is  intuitively  apparent.  We  also 
remark  that  in  the  residual  series  (see  Figure  5)  there  remain  some  significant  auto-correlations 
at  lag  12.  We  can  account  for  this  by  fitting  an  ARIMA  (0,0,0)  *(2,0,0)12  model  to  the  residu¬ 
als.  However,  this  would  provide  us  with  only  34  observations  with  which  to  perform  a  cross 
correlation  analysis.  Since  paucity  of  data  can  lead  us  to  questionable  results,*  we  elect  to  con¬ 
tinue  working  with  the  residuals  of  the  model  (4.2).  These  residuals  are  given  in  Table  1  as 
values  of  bt\  also  given  there  are  values  of  a,y  the  prewhitened  flying  hours. 


ft  *• 
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Figure  5  Intervention  model  for  alert  availability,  graph  of  the  ACF  of  the  residual  series 


'Actually,  we  did  perform  a  cross  correlation  analysis  with  only  34  observations  and  obtained  results  that  were 
counter-intuitive 
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5.  CROSS  CORRELATION  ANALYSIS 

Using  the  prewhitened  series  ( a ,}  and  {6,},  we  now  develop  the  transfer  function  model 
(2.1).  We  remark  that  the  { a ,)  series  with  its  59  observations  must  be  modified  by  omitting  the 
January  1975  observation  so  that  it  will  match  the  [6,1  series,  which  has  58  observations. 

Following  procedures  of  Box  and  Jenkins  and  using  the  estimated  "impulse  response 
weights,"  Vk,  shown  in  Figure  6,  we  remark  that  possible  values  of  (r,s,b)  in  (2.1)  are  (1,3,0) 
or  (2,3,0).  From  the  autocorrelation  functions  of  the  generated  noise  series  (see  Singpurwalla 
and  Talbott  [4]),  we  remark  that  a  possible  model  for  the  noise  N,  is  an  ARIMA  (1.0,1). 


K 

V(K) 

0 

0.224 

1 

0.260 

2 

0.035 

3 

0.126 

4 

0.026 

S 

0.000 

6 

0.078 

7 

0.127 

8 

-0.025 

9 

0.023 

10 

-0.016 

11 

0.149 

12 

-0.227 

13 

-0.213 

14 

-0.021 

15 

0.146 

Figure  6.  Estimated  impulse  response  weights 
ViK)  from  a  cross  correlation  of  prewhitened 
flying  hours  and  alert  availability 


We  fit  both  the  (2,3,0)  and  the  (1,3,0)  transfer  function  models  with  an  ARIMA  (1,0,1) 
noise  component  to  the  data  and  find  that  the  more  parsimonious  (1,3,0)  model  results  in 
better  residuals.  This  estimated  model,  whose  residuals  have  an  ACF  as  shown  in  Figure  7,  is 

(5.1)  (1  +  .5 3B)b,  =  (.25  +  .51/9  +  .24 B2  +  .1  B})a,  +  j  -  -j  lf  e,, 

1  *r  A2a 

where  e,  is  random  noise. 

6.  THE  COMBINED  INTERVENTION  ANALYSIS  TRANSFER  FUNCTION  MODEL 

We  can  expand  the  transfer  function  model  (5.1)  for  the  prewhitened  series  (a,)  and  {*,) 
by  using  the  following  relationships  from  (3.1)  and  (4.2): 

^  n  (1  -  .54 B  +  ,llfl2)(l  +  .38 B*  +  .37/98) (X,) 
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Figure  7.  A  (1,3,0)  transfer  function  model  between  flying  hours  and  alert 
availability;  graph  of  the  ACF  of  the  residual  series 


b,  -  (1  -  .185)0  +  .3556Hl  -  B6)  Y,  +  ^°  2  .  J,  +  -  ^  K, 


-  .151,, 


where  X,  -  (X,  -  10.09). 


Subslituling  (6.1)  and  (6.2)  for  a,  and  b,  in  Equation  (5.1),  we  obtain  a  multiplicative  transfer 
function  model  relating  the  input  series  {A’,}  to  the  output  series  { Y,)  as 

(1  +  .1855)0  +  .535)0  -  .185)0  +  ,3556)(1  -  56)(1  +  .125)  r, 

-  (.25  +  .515  +  ,2452  +  ,153)(1  -  .545  +  ,1152)(1  +  ,3854  +  .37 58) 

■(1+  \2B)X  —  ^  -125)0  4-  .53 5)  r  on  H 

(1  +  . 1 2 B > Jf,  (|  +  J84i)(1  _  Bl) 


(1  +  .125)(1  -I-  .535) 
(1  +  ,185S)U  -  53) 


1.01  K,) 


,  (1  +  .  1 25) (1  +  .535)  r  ,f,  ,  , 

+  - (I  +  ,18S>) - '  l5t''  +,‘  "  llSlf' 

Note  that  X,  and  Y,  in  (6.3)  are  in  terms  of  the  logarithms  of  the  original  data. 

Expanding  the  polynomials  and  rewriting  Equation  (6.3)  in  conventional  notation  (in  the 
interest  of  parsimony,  coefficients  with  a  value  less  than  ,1  are  arbitrarily  deleted),  we  can 
reduce  the  transfer  function  model  to 
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n  = 

1 

4-*- 

•-4 

i 

+ 

1 

1 

oo 

_5  +  .59L,_6  +  .31K,_7  +  .12T,_1] 

+ 

•39  F,_12  +  .17  y,_I3 

(6.4) 

+ 

.25  X,  +  ,68T,_|  +  .1 1  A",_3 

+  ,26T,_5  +  ,25T,_9 

+ 

•42e,_i  +  .18e,_5 

+ 

Z,  +  .65Z,_i  ~  - 1 8 Z,_ 5  — 

•  12Z„6 

where,  following  the 

notation  of  Section  4, 

Z,  =  {[  -  .02 7,  -  .027, _3  -  .02 7,_6  -  .027, _9  -  . . .  ] 

+  [-OIK,  -  ,01K,_3-  ,01K,_6-  ,01K,_9-  ...1  +  .15L,). 

7.  CONCLUDING  REMARKS 

From  Equation  (6.4),  we  remark  that  the  nonzero  coefficients  associated  with  the  X,'s 
imply  that  flying  hours  do  have  an  effect  on  the  alert  availability  in  a  manner  specified  by  the 
functional  form  of  the  equation.  Furthermore,  an  examination  of  Figure  2,  together  with  the 
prewhitening  transformation.  Equation  (3.1),  reveals  that  there  was  no  upward  or  downward 
trend  in  the  flying  hours  during  the  period  of  study.  However,  there  does  appear  to  be  a  reduc¬ 
tion  in  the  variability  of  the  flying  hours  as  of  the  inception  of  RCM.  In  any  case,  it  appears 
that  Equation  (6.4)  supports  the  adage  that  "the  more  you  fly,  the  less  you  fail,"  within  limits. 

The  negative  coefficients  (albeit  small)  associated  with  the  variables  7,  and  K ,  in  Equation 
(6.4)  do,  in  the  absence  of  any  upward  or  downward  trend  in  flying  hours,  support  our  premise 
that  RCM  has  a  tendency  to  reduce  the  alert  availability. 

The  model.  Equation  (6.4),  can  be  used  not  only  for  interpretative  purposes  as  is  done 
above,  but  within  limits  and  with  proper  care,  it  can  also  be  used  to  predict  future  availability 
given  its  previous  values,  and  the  present  and  previous  values  of  the  flying  hours. 
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ABSTRACT 

A  theoretical  and  computational  investigation  is  made  of  the  performance 
of  a  dynamic-programming-based  algorithm  for  nonlinear  integer  problems  with 
various  types  of  constraints.  We  include  linear  constraints,  aggregated  linear 
constraints,  separable  nonlinear  constraints  and  constraints  involving  maxima 
and  minima.  Separability  of  the  objective  function  is  assumed.  The  new 
feature  of  the  algorithm  is  that  two  types  of  fathoming  or  pruning  are  used  to 
reduce  the  size  of  tables  and  number  of  computations:  fathoming  by  bounds 
and  fathoming  by  infeasibility 


1.  INTRODUCTION 

Several  lines  of  research  have  been  suggested  for  mathematical  programming  problems 
which  are  nonlinear  and  also  restricted  to  integer  values  for  the  decision  variables.  We  define  a 
very  general  notation  for  such  a  problem: 


n 


(1) 

max 

2  ”  £  4>j(Xj) 

J- 1 

subject  to 

(PI) 

(2) 

/;,( x)  <0  i  —  \m  . . 

.  ,  m 

(3) 

Xj  ^  0,  integer  J  =  1,  . . . 

,  n 

where  <£,(*,)  are  nondecreasing  functions  and  the  constraint  set  is  bounded  and  nonempty,  and 
X  =  (X|,X2 . xn). 

Applications  that  have  been  or  could  be  modeled  by  a  representation  of  the  form  (PI) 
include  multidimensional  knapsack  problems  [5],  optimal  allocation  of  scarce  resources  in  pro¬ 
duction  planning  (5],  servicing,  repair  and  replacement  of  machinery  [5],  [7],  [2],  control  of 
inventory  [181,  chemical  reaction  in  a  stirred  tank  [1],  bottleneck  problems  in  ecomomics  [3], 
[I],  [51,  communication  network  problems  [21],  [4],  models  of  growth  and  competition  [1], 
[61,  many  aspects  of  chemical  engineering  [81,  defensive  missile  allocation  [331,  and  nonlinear 
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resource  allocation  (37),  [38].  In  the  special  case  of  binary  variable  problems  there  are  applica¬ 
tions  to  capital  budgeting  [29],  sewage  treatment  and  irrigation  systems  [28],  media  selection 
[36],  cluster  analysis  [31],  scheduling  [19],  [30],  [2],  location  problems  [35],  [12],  [32],  and 
hydrological  studies  [12]. 

Cutting  plane  approaches  to  this  problem  have  been  suggested  by  Witzgall  [34],  and 
Granot,  Granot,  and  Kallberg  [16],  among  others.  Solution  methods  based  on  enumeration  or 
branch  and  bound  have  been  given  by  Laughunn  [25],  Hansen  [20],  McBride  and  Yormark 
(261,  and  Korte,  Krelle,  and  Oberhofer  [22]  -124],  Other  authors  have  taken  the  approach  of 
linearizing  to  a  linear  integer  programming  problem  and  using  existing  integer  algorithms. 
Papers  by  Glover  [13],  and  Glover  and  Woolsey  [14],  [15],  and  Granot  and  Granot  [17]  lie  in 
this  class.  The  present  paper  is  based  on  a  fourth  approach  to  nonlinear  integer  programming: 
using  dynamic  programming  methodology  while  mitigating  the  traditional  difficulty  dynamic 
programming  techniques  have  with  multiply-constrained  problems.  This  aim  is  referred  to  in 
the  literature  as  reduction  of  state  dimensionality  [3]. 

2.  HYPERSURFACE  SEARCH 

The  hypersurface  search  method  uses  discrete,  deterministic  dynamic  programming 
methodology  to  identify  integer  points  on  the  hypersurface  given  by  the  objective  function 
when  the  objective  function  value  is  set  equal  to  some  upper  bound  (in  a  maximization  prob¬ 
lem).  Initially  we  "bypass"  the  original  constraints  of  the  problem  and  utilize  dynamic  program¬ 
ming  recursion  formulas  for  the  following  problem  (P2)  which  is  based  only  on  the  objective 
function  of  (PI): 


n 


(4) 

max 

2  “  I 

subject  to 

(5) 

X  <  zo 

y-l 

(6) 

0  <  Xj  <  Uj  j  -  1 ,  . . 

•  ,  n 

(7) 

Xj,  integer  y-l,  . . 

.  ,  n. 

where  z0  is  an  upper  bound  on  z  and  ut  are  variable  upper  bounds.  We  complete  the  "forward 
pass"  of  dynamic  programming.  Then  integer  points  are  ordered  according  to  decreasing  value 
of  the  objective  function  using  some  efficient  sort  routine  (we  used  Quicksort),  and  they  are 
tested  separately  for  infeasibility  with  the  original  constraints  (2)  of  the  problem  (PI).  Each 
candidate  integer  point  is  developed  component  by  component  in  the  usual  "backward  pass"  of 
dynamic  programming.  But  if  we  have  one  or  more  separable  constraints  we  may  detect 
infeasibility  before  completing  the  backward  pass  at  the  partial  solution  stage.  Assume  that  in 
problem  (PI),  the  inequalities  (2)  are  replaced  by 

(8)  £  h,j(xj)  <0,  i—l . m. 

j- 1 


Then  a  point  can  be  eliminated  as  soon  as  any  new  component  gives  a  partial  sum 
(9)  £  hjxp  >  0 

y-* 
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for  any  constraint  i.  Here  we  are  assuming  a  separable  form  for  some  or  all  of  the  constraints 
and,  in  addition,  the  nonnegativity  of  constraint  coefficients.  This  algorithm  is  described  in 
Cooper  [9]. 

In  a  portfolio  application  [10]  we  have  implemented  several  fathoming  techniques  used  by 
Morin  and  Marsten  [27]  and  others,  and  generalized  by  Denardo  and  Fox  [11],  In  order  to  use 
fathoming  we  must  change  the  usual  calculation  of  the  recursive  equations.  It  is  replaced  by  a 
label-setting  technique  which  has  been  called  "reaching"  in  shortest  path  applications.  This 
allows  us  to  prune  the  entries  in  the  dynamic  programming  tables.  We  eliminate  partial  solu¬ 
tions  which  would  correspond  to  solutions  with  inferior  values  of  the  objective  finction,  and 
also  partial  solutions  for  which  any  complettion  would  be  infeasible.  The  corresponding  two 
types  of  fathoming  are  called  fathoming  by  bounds  and  fathoming  by  infeasibility.  The  con¬ 
straints  must  be  separable,  and  be  nondecreasing  functions  in  order  to  get  full  computational 
effectiveness. 

In  this  paper  we  extend  the  idea  of  fathoming  by  infeasibility  from  the  portfolio  applica¬ 
tion  to  the  general  problem  with  separable  objective  function  and  many  different  forms  of  con¬ 
straints.  We  investigate  linear,  aggregated  linear,  separable,  and  nonseparable  constraints. 

3.  LINEAR  CONSTRAINTS 

Computational  testing  was  done  on  random  problems  with  linear  and  squared  terms  in  the 
objective  function  and  linear  constraints.  The  motivation  for  these  tests  was  to  find  out 
whether  fathoming  by  infeasibility  with  every  constraint  is  superior  to  aggregating  some  of  the 
constraints,  and  generating  integer  points,  and  then  testing  the  solutions  with  the  disaggregated 
constraints,  as  in  [9].  Here  are  the  two  options  tested: 

(a)  Fathoming  by  infeasibility  with  each  constraint.  Then  all  solutions  at  the  end  of 
the  forward  pass  will  be  feasible  with  the  original  constraints.  Sort  according  to 
objective  function  value  to  find  the  optimum. 

(b)  Fathoming  by  infeasibility  with  aggregated  linear  constraints.  Several  constraints 
are  aggregated  and  the  resulting  constraint  is  used  to  eliminate  some  infeasible 
solutions  in  the  forward  pass.  Sorting  on  objective  function  value  is  done  and  a 
backward  pass  through  the  dynamic  programming  tables  tests  solutions  as  their 
components  are  found  for  feasibility  with  the  original  disaggregated  constraints. 

For  problems  with  special  structure  a  good  bound  may  be  found  so  that  fathoming  by 
bounds  can  be  implemented,  but  for  these  very  general  problems,  no  such  good  bound  can  be 
found  easily. 

4.  COMPUTATIONAL  RESULTS  FOR  LINEARLY  CONSTRAINED  PROBLEMS 

Ten  randomly  generated  problems  were  solved  for  each  of  three  sizes  of  problems  with 
nonlinear  objective  function  and  linear  constraints.  They  were  solved  by  Option  (a),  and 
results  are  given  in  Table  1. 
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TABLE  1  —  Quadratic  Separable  Objective  Function  with  Linear  Constraints 
Using  Fathoming  by  Infeasibility 


Problem  No. 

10  Variables 
and 

5  Constraints 

15  Variables 
and 

7  Constraints 

20  Variables 
and 

10  Constraints 

1 

.01 

4.31 

.02 

2 

.02 

3.86 

44.14 

3 

.01 

25.94 

16.95 

4 

.43 

79.49 

150.37 

5 

.11 

275.73 

.03 

6 

.1 

.02 

141.63 

7 

.29 

.02 

.02 

8 

.01 

.05 

36.63 

9 

.01 

.01 

.02 

10 

.01 

.01 

.05 

Average 

.1 

38.94 

38.99 

Standard 

deviation 

.15 

86.85 

58.74 

TABLE  2  —  CPU  Time  in  Seconds  for  Problems  With  Aggregated  Linear 
Constraints,  (x  Means  Time  Limit  Exceeded.) 


Problem  No. 

10  Variables 
and 

5  Constraints 

15  Variables 
and 

7  Constraints 

20  Variables 
and 

10  Constraints 

1 

.15 

4.05 

.06 

2 

.05 

26.91 

X 

3 

.01 

X 

X 

4 

1.7 

X 

X 

5 

1.89 

10.21 

.07 

6 

.36 

.09 

X 

7 

4.07 

.01 

.03 

8 

.01 

.04 

X 

9 

.01 

.01 

.02 

.09 

.01 

17.46 

Average 

.83 

5.17 

3.53 

Standard 

deviation 

1.34 

9.5 

7.79 
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From  these  results  we  see  that  the  CPU  times  on  a  CDC  6600  in  seconds  are  highly  variable. 
The  algorithm  was  efficient  on  some  of  the  largest  size  problems,  and  comparatively  good  for 
medium  and  small  problems.  Next  we  give  the  computation  times  for  the  Option  (b)  algo¬ 
rithm. 

These  are  identical  problems  for  which  the  constraints  have  been  aggregated  betore  fathoming. 
We  see  that  performance  has  deteriorated  significantly,  and  many  15  variable  problems  were 
not  solved  to  completion.  The  conclusion  is  that  fathoming  is  a  very  important  element  in  the 
hypersurface  search  algorithm,  and  if  special  structure  permits,  fathoming  by  bounding  would 
be  expected  to  improve  performance  times  if  it  is  implemented.  Obviously,  the  tightness  of  the 
constraints  or  bounds  is  the  key  factor  in  evaluating  the  resulting  improvement. 

5.  NONLINEAR  AND  NONSEPARABLE  CONSTRAINTS 

The  type  of  constraint  permitted  using  the  algorithm  described  above  is  very  broad.  Non¬ 
linear  and  even  nonseparable  constraints  are  handled  without  any  difficulty  in  forward  pass 
fathoming.  As  we  generate  the  forward  dynamic  programming  tables  we  can  look  backward  at 
any  stage,  k ,  and  trace  out  a  partial  solution  ....  x\.  Let  us  assume  that  the  constraint 

is  linear: 

a ^xi  +  a 2-<2  +  . . .  +  a„x„  <  b,  all  aj  >  0. 

If  we  have  reached  stage  k  when  we  do  the  fathoming  test,  we  are  testing  the  following  inequal¬ 
ity: 

X  aj(x?  >  b- 

J- 1 

It  is  easily  seen  that  if  the  constraint  is  nonlinear  and  separable ,  that  is, 

i  v.v/>  < b- 

/-i 

then  fathoming  test  at  the  kth  stage  would  be: 

£  hj(x')  >  b. 

j- i 


Therefore,  we  have  shown  that  the  forward  fathoming  method  can  be  implemented  for  one  or 
more  nonlinear  separable  constraints  with  no  additional  difficulty  in  logic  or  computation. 

Now  we  consider  the  logic  for  nonseparable  constraints:  crossproducts,  maximum  or 
minimum  expressions,  etc.  The  only  change  we  need  to  implement  in  the  forward  fathoming 
logic  is  that  a  partial  solution  is  not  tested  for  infeasibility  until  all  values  xj  involved  in  a  non¬ 
separable  term  are  developed.  For  example,  let  the  constraint  be: 

.V|.x2  +  max  (xjt4.v4)  <  b. 
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Instead  of  testing  for  infeasibility,  that  is  fathoming  at  stages  1,  2,  3,  and  4,  we  delay  testing 
until  stages  2  and  4.  At  the  second  stage  the  test  is 

Jf,*jr2*  b. 

If  not,  the  entry  corresponding  to  x2  =  x 2  is  deleted  from  the  second  stage  table.  No  test  is 
made  at  stage  three,  but  at  stage  four  we  test  the  inequality  again  with  another  partial  solution: 

x[x2  4-  max  (xj, 4xl )  ^  b. 


If  the  inequality  is  violated,  then  the  partial  solution  (xf.xj.xj.x^)  is  deleted  with  all  its  comple¬ 
tions.  Therefore,  the  only  effect  of  using  the  algorithm  on  nonseparable  constraints  would  be  a 
possible  reduction  in  the  efficiency  of  the  fathoming  algorithm  since  it  could  be  implemented 
only  for  stages  at  which  all  components  of  a  nonseparable  term  had  been  developed.  An  alter¬ 
native  approach,  which  seems  wasteful,  would  be  to  use  known  transformation  techniques  for 
factorable  functions  which  result  in  a  separable  form,  but  always  at  the  expense  of  added  vari¬ 
ables  and  constraints. 

CONCLUSION 

The  hypersurface  search  algorithm  has  been  improved  using  techniques  derived  from 
shortest  path  problem  algorithms  in  order  to  allow  a  reduction  of  computational  effort  in 
evaluating  the  functional  equations  of  dynamic  programming  and  in  reducing  the  number  of 
entries  or  states  in  the  tables.  We  have  presented  computational  results  based  on  these  tech¬ 
niques  for  linear  constraints  and  have  shown  how  they  can  be  used  with  no  additional  difficulty 
for  nonlinear  and  even  nonseparable  constraints. 
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ABSTRACT 


In  this  paper  we  introduce  a  discrete  state  level  crossing  analysis  and 
present  some  basic  results  and  a  key  theorem  of  level  crossings.  We  illustrate 
the  fertility  of  the  discrete  slate  level  crossing  analysis  by  applying  it  to  queue¬ 
ing  systems  with  (i)  bulk  arrival,  (ii)  instantaneous  feedback,  (iii)  limited  wait¬ 
ing  space,  and  (iv)  to  machine  interference  problems. 


1.  INTRODUCTION 

In  this  paper  we  define  several  event  epochs  associated  with  a  general  queueing  swem 
and  derive  relations  among  the  steady  state  probabilities  associated  with  these  event  civ.lis 
Often,  in  queueing  theory,  embedded  Markov  chains  tire  used  to  obtain  the  steady  stale  ;  •  :  a 
bility  distributions  of  the  number  in  the  system  at  arrival  or  departure  epochs.  Then  specific 
approaches,  suited  to  individual  queueing  problems,  are  used  to  relate  the  steady  state  probabil¬ 
ity  distributions  associated  with  different  event  epochs  such  as  arrival,  departure,  feedback,  and 
rejection  and  with  arbitrary  time  epoch.  For  example,  see  Chaudhry  [7],  Chow  [8],  Courtois 
and  Georges  [14],  Disney,  McNickle  and  Simon  [15],  Meyman  and  Stidham  [18],  Neuts  [321 
and  Takacs  [33], 

In  this  paper  we  present  some  basic  relations  among  these  steady  state  probabilities  and  a 
key  theorem  of  level  crossings  relating  the  steady  state  probabilities  at  arrival  and  dci  no 
epochs.  The  fundamental  idea  behind  this  key  theorem  is  the  relationship  between  the  i.  ;d 
downcrossings  of  regenerative  processes. 

Level  crossing  analysis  that  utilizes  the  relationship  between  up  and  down  crossings  of  sto¬ 
chastic  processes  with  continuous  state  space  is  introduced  by  Brill  [1],  Brill  and  Posner  [3.4,5], 
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and  independently  by  Cohen  [9,101.  Also  an  alternate  imbedded  level  crossing  analysis  is  dis¬ 
cussed  by  Brill  [21.  Shanihikumar  [23]  extended  Cohen’s  [9,10]  approach  to  alternating  regen¬ 
erative  processes  with  continuous  state  space  and  illustrated  its  applicability  to  several  queueing 
problems  with  server  interruptions  and  several  classes  of  customers  (see  [24-27]). 
Shanihikumar  [281  also  introduced  an  alternate  form  of  level  crossing  analysis  to  regenerative 
processes  with  continuous  state  space. 

All  the  approaches  mentioned  above  are  specifically  designed  for  stochastic  processes  with 
continuous  state  space  and  do  not  directly  extend  themselves  to  stochastic  process  with  discrete 
state  space.  In  this  regard,  we  introduce  a  new  level  crossing  analysis  that  is  applicable  to  sto¬ 
chastic  processes  with  discrete  state  space.  The  fertility  of  this  method  is  illustrated  by  a 
simpler  derivation  of  existing  results  for  some  queues  with  bulk  arrival  and  customer  feedbacks. 
New  results  are  derived  for  general  queueing  systems  and  a  machine  interference  problem. 
Further  application  of  this  approach  to  develop  new  and  efficient  algorithmic  solutions  to  a 
computer  system  model  is  illustrated  in  [30]  and  to  an  automatic  transfer  line  model  in  [32). 
Also,  using  this  discrete  state  level  crossing  analysis  a  new  hybrid  simulation/analytic  model  is 
developed  for  a  computerized  manufacturing  system  (see  [31]).  See  Shanthikumar  [29]  for  a 
recent  application  of  this  method  to  a  single  server  queue  with  server  failures  and  state- 
dependent  arrival  process.  In  the  past,  some  basic  exploitation  of  this  idea  of  discrete  state 
level  crossing  analysis  has  been  carried  out  by  Cooper  [11],  Gross  and  Harris  [17],  and  Jain  and 
Seth  [19], 

In  Section  2  we  present  some  basic  relations  and  the  key  theorem  of  level  crossings.  We 
illustrate  the  fertility  of  this  level  crossing  analysis,  in  Section  3,  by  applying  it  to  queueing  sys¬ 
tems  with  (i)  bulk  arrival,  (ii)  instantaneous  feedback,  (iii)  limited  waiting  space,  and  to  (iv) 
machine  interference  problems. 

2.  THE  BASIC  RESULTS 

In  this  section,  we  will  derive  some  basic  results  and  a  key  theorem  of  level  crossings  for 
a  special  case  of  regenerative  process,  which  occur  in  many  queueing  systems.  These  results 
will  be  used  later  to  analyze  some  queueing  systems. 

The  Regenerative  Process 

A  regenerative  process  [XU),  i  >  0)  is  a  stochastic  process  which  starts  anew  probabilist¬ 
ically  at  an  increasing  sequence  0  ^  R\  <  R2  <  ...  of  random  epochs  on  the  time  axis 
[0,  °°).  Thus,  between  any  two  consecutive  regenerative  epochs  R,  and  R,+\<  the  portion 
{X(t).  R,  <  r  <  /?,  + 1 )  called  regeneration  cycle,  is  an  independent  and  identically  distributed 
replicate  of  the  portion  between  any  other  two  consecutive  regeneration  epochs  [9,13].  We 
represent  the  time  intervals  between  any  consective  regeneration  epochs  by  the  set  [7]  /  ^  1] 
where  T,  =  /?,*,  -  R,,  and  assume  E(Tt)  <  °°  for  all  values  of  i  >  1. 

The  process  {X(t),  r  ^  0}  is  assumed  to  have  a  state  space  ,V+  of  nonnegative  integers 
and  to  have  sample  paths  which  are  continuous  from  the  right  and  which  make  only  a  finite 
number  of  jumps  in  each  finite  time  interval.  In  almost  all  queueing  systems,  the  epochs  when 
customers  arrive  to  find  all  servers  idle  constitutes  a  sequence  of  regeneration  epochs.  If  XU) 
denotes  the  number  of  customers  at  time  t.  i  ^  0,  then  a  sample  realization  of  this  process  can 
be  represented  by  Figure  1.  In  the  queueing  context,  an  accepted  external  arrival  of  a  batch  of 
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Regeneration  Cycle 


Figure  1  A  sample  realization  of  the  regenerative  process  XU) 


customers  at  the  system  is  represented  by  an  upward  jump  and  a  departure  from  the  system  by 
a  downward  jump.  The  magnitudes  of  the  upward  and  downward  jumps  are  assumed  to  be 
independent  and  depend  only  on  the  level  (that  is,  the  state)  of  X(t)  just  before  the  upward 
and  downward  jumps,  respectively.  The  magnitudes  of  these  upward  and  downward  jumps 
from  level  ware  assumed  to  have  distributions  \gn(k),  k  >  1)  and  {/„(&),  k  >  1)  with  finite 
means  E(Xn)  and  E(Yn ),  respectively.  Wc  allow  for  the  possibility  of  rejections.  That  is,  an 
external  arrival  of  a  batch  of  customers  seeing  a  ievel  n  may  be  rejected  (that  is,  entry  to  the 
system  is  denied)  with  probability  gn.  These  epochs  of  rejections  are  marked  by  stars  in  the 
sample  path.  Similarly,  feedback  is  also  allowed  (see  Figure  2).  That  is,  customers,  after 
service  completion,  rejoin  the  queue  with  probability  /„,  for  more  service  instead  of  leaving  the 
system.  These  epochs  are  represented  by  hollow  circles  in  the  sample  path.  Note  that  neither 
an  upward  nor  a  downward  jump  occurs  at  the  rejection  or  feedback  epoch. 

Let  the  sets  U,  D,  R  and  F  contain  the  epochs  of  upward  jumps,  downward  jumps,  rejec¬ 
tions  and  feedbacks,  respectively,  within  the  first  regeneration  cycle  during  [R\,  /?2).  Then 
A  =  U  U  R,  C  =  £>  U  F,  and  /  —  U  U  Fare  the  sets  of  external  arrival  epochs,  service  com¬ 
pletion  epochs  and  input  epochs  to  the  system  including  feedback,  respectively.  Now  for  each 
level  n  of  T(r),  define  U~,  D~ ,  .  F~ ,  A~,  C“,  and  l~  as  the  sets  of  epochs  of  upward 

jumps  from  level  n,  downward  jumps  from  level  n.  rejections  at  level  «,  feedback  at  level  w, 
arrivals  to  see  level  w,  service  completions  at  level  «.  and  inputs  to  see  level  n,  respectively, 
during  the  first  regeneration  cycle. 

Similarly  define  U* ,  D* ,  R*.  F* ,  A „+,  C„+,  and  /„*  as  the  sets  of  epochs  of  upward 
jumps  to  level  w,  downward  jumps  to  level  n  rejections  at  level  w,  feedback  at  level  n  after 
arrivals,  service  completions  to  level  w,  and  level  n  after  inputs,  respectively,  during  the  first 
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regeneration  cycle.  The  sets  E~  and  £„+  basically  represent  the  epochs  at  which  the  level  is  n 
just  before  and  just  after  the  occurrence  of  the  event  e,  for  e  6  {u,  d,  r,  f  a.  c,  /)  A  5,  gener- 
ically  associated  with  the  set  E  €  [U,  D,  R,  F.  A,  C.  /}  of  events  as  defined  above.  Note  that 
R;  =  Rn\  and  f;  =  f;. 


Let  eTk  be  the  time  of  the  fcth  occurrence  of  event  e.  Then  if  #  \E)  is  the  cardinality  of  a 
set  E  and  8(0  is  the  indicator  function,  we  have 

r'A  #{k:  K,  <  eTk  <  R2\  =  #{£) 

er;  A  Z8(X(eTk-)  =  n)  -  #{£"1. 

*-i 


and 


„t;a  £5UU+)-»)-#(a 

*-l 

when  e  and  E  are  generically  matched.  Now  we  will  define  the  associated  probability  distribu¬ 
tions  (see  Figure  2) 

p~(n)  =  lim  P{X(t,Tk-)  =  «) 


and 


p ;(n)  =  lim  F( X (,, Tk  )  =  n\. 

k  °° 


The  existence  of  these  distributions  is  guaranteed  by  the  regenerative  property  of 
lX(t),  i  ^  0).  From  the  mean  value  representation  for  the  regenerative  processes,  we  have 


Pe  (») 


T|,.t„  } 
£{r'| 
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and 


/v+(«) 


E{eTt\ 

E\r'} 


Based  on  the  earlier  observation  that  =  Rf  and  F^  =  F* ,  one  has 

p/(n)  =  p/in)  ^  p/(n),  n  ^  1 
and 

p~in )  =  p/in)  4  p,in),  n  ^  0. 


Relationships  between  pfin)  and  p/in)  are  an  immediate  consequence  of  the  above  "stochas¬ 
tic  mean  value"  equations.  For  example,  we  have  the  following: 

(1)  p/in)  =  £  P/ii)  g/n  -  /'HI  -  ft)  +  Pa(n)g„,  n  >  0 

i-0 

(2)  p/in)  =  £  A/O)  ft(n  -  i).  n  >  0 

i-0 

(3)  /»f+(w)  =  =  pH”)*  n  ^  o 

(4)  Pfin)  —  p/(n)  -  n  >  1 

(5)  p/(n)  -  X  P/H)  /<(i  -  n)  n  >  0 

/-#+! 

A+(»)  -  £  Ar"(»)  //('  ~  "HI  ~  //)  +  P/in)f„,  n  >  1 

/-«+i 

(6)  -  £  p~ii)  /,(/)( 1  -  /,),  «  -  0 


Let  be  the  rate  of  occurrence  of  events  of  type  e.  That  is, 

£{T'}/£{rt),  Vr6S. 

Using  the  relationship  between  the  rates  of  occurrence  of  different  event  epochs,  we  will 
derive  the  following  relations: 


(7) 

(8) 

(9) 

(10) 


Pu  in)  =  pa  (n)(l  -  ft,)/ 


l  -  £  Pa  («>«,}•  «  >  0 

n- 0  ' 


ftH«)  -  Pa  in)gj\  in)g„ 


n“0 


Pd  in)  -  £.“(«)(  I  -  /„)/ 


1  “  £  ftc  («)/, 


1 


,  n  >  0 
,  //  >  0 


/y(»)  -  /?f  («)/„/ 


X  Pc  in)f„\.  n  >  0 

n-1 
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Let  the  sets  U~,  D~ ,  R~,  F~,  A~ ,  C~ ,  and  l~  be  as  defined  earlier.  Now  define 
(generically  subscripting)  r~(n),  r^in),  r,~(rt),  rf(rt),  r'(n)  ,  r~(n),  and  r,“(n)  as  the  rates 
of  upward  jumps,  departure,  rejections,  feedbacks,  arrivals,  completions,  and  inputs  with 
respect  to  level  n  of  the  X(t)  process,  such  that, 

(11)  /•,'(«)  =  E{eT~)/E(Tx),  n>  0, 

for  e  £  S  A  {«,  d,  r,  f,  a,  c,  /}  and  E  €  [U,  D,  R,  F,  A,  C,  /),  where  the  pair  (e,E)  is 
chosen  to  match  generically.  Note  that 

^“(0)  =  0  for  e  c  {/,  c,  d\ 

oo 

and,  since  U  E~  =  E, 

«- o 

(12)  rt  =  T  r~(n),  We  €  S. 

«to 

Then  from  the  "stochastic  mean  value  equation"  for  Pein),  we  have 

(13)  Pf(n)  =  r~(n)/rt,  V  e  €  5. 

Therefore,  the  interrelationships  between  these  probabilities  can  be  derived  using  the  relation¬ 
ship  between  the  rates  r~(n),  e  €  S.  Since  the  probability  of  rejection  of  a  batch  on  its  arrival 
is  g„  if  it  sees  n  in  the  system,  we  have 

(14)  /•„'(«)  =  r~(w)(l  -  g„),  n  >  0 
and 

(15)  rr~~(n)  =  r~(n)g„,  n  >  0. 

Then  from  (12),  (13),  and  (14),  equation  (7)  follows.  Similarly,  from  (12),  (13),  and  (15), 
equation  (8)  follows.  A  similar  inspection  of  completion  epochs  gives 

(16)  /•/(«)=«  r~(n)(\  — /„),  n  >  0 
and 

(17)  rf(n)  =  r~(n)f„,  n  >  0. 

Then  similar  to  the  above  cases,  equations  (9)  and  (10)  follow  from  (12),  (13),  (16),  and  (17). 

Note  that  when  g„  =  g,  V  n  >  0,  from  (7)  and  (8)  pa~(n)  —  Puin)  =  pr(n),  and  simi¬ 
larly  when  the  feedback  probability  /„  is  independent  of  rt,  that  is,  f„  —  /,  V  n  >  1,  from  (9) 
and  (10),  p~{n)  —  pj(n)  =  p/(n). 

Next  we  will  establish  the  following  relations: 


(18) 


/?/(«)  + 

fn 

Pd  (n) 

l~fn 

oo 

'  + 1 

1 

fn 

Pd~(") 

1  -  fn 

»  • 


»  • 


» 


I 


I 


I 
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Pf(n) 


Pu  ( n)ru  +  rd 

fn 

1  -/„ 

ft*  (n) 

ru  +  rd  X 

rt- 1 

/. 

1  -/» 

Pd  {n) 

,  n  Ss  1 


Pu(n)ru  +  rd  y ~~  pd  ( n ) 

(20)  p,+  (/r)  =  - - - ,  n  >  1 

ru  +  rdj^  i  ^  f  Pd (w> 
n-1  1  A 

where  ru  and  ^  are  the  rates  of  upward  and  downward  jumps  as  defined  earlier. 

Now  as  defined  earlier,  let  Uf,  I* ,  D+,  and  C„+  be  the  sets  consisting  of  epochs,  just 
after  an  upward  jump,  input,  downward  jump,  and  completion  when  X(t)  =  n.  That  is, 

E+-  {/:  *(r+)  =  n,  t  €  E), 


E  €  [U,  I,  D,  C}. 

Now  generically  define  the  rates 

(21)  re+(n)  =  £{fT^}/£{7’1},  e  €  {u,  i,  d,  c),  n  ^  0 

oo 

and  since  (J  E„+  5  E,  we  have 

/I-O 

oo 

rt  -  X 

rt— 0 

Then,  as  in  the  previous  case. 

(22)  ft+(/i)  =*  re+(n)/re,  e  €  {«,  i,  d,  c}. 

Note  that 

(23)  r+(n)  =*  ry(«)  +  rd{n), 

and  therefore,  from  (21),  (22),  and  (23)  we  get  equation  (18),  after  noting  from  (16)  and  (17) 
that  rf(n)  -  rd(n)f„l(  1  -  /„).  Similarly,  the  relation 

(24)  r~(n)  -  r~(n)  +  rf(n) 
and 

(25)  r,+  (/j)  **  rf(n)  +  rf(n) 

along  with  rf(n)  -  rj(n)f„/(  1  -  /„),  (21)  and  (22)  lead  to  (19)  and  (20). 

Equating  the  number  of  customers  arrived  and  departed  during  a  regeneration  cycle,  we 


X  ru~(n)E(Xn)-  £  r-(n)E(Yn). 

n-0  n-1 
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These  relations  (1),  (2),  (3),  (7),  and  (8)  form  one  class  such  that  if  p0“(n),  n  >  0,  is  known, 
we  can  derive  pf(n).  pj(n).  p„(n),  and  pr{n).  Similarly,  the  relations  (4),  (5),  (6),  (9), 
(10),  and  (18)  form  a  second  class  such  that  if  p^in),  n  >1,  is  known  we  can  obtain 
Pj(n),  p,(n ),  p~(n),  and  p+(n).  However,  to  relate  these  two  sets  of  probabilities  and  to  use 
(19)  and  (20),  we  need  a  relation  between  two  probability  distributions,  one  from  each  set. 
Such  a  relation  is  our  key  theorem  of  level  crossings,  which  we  present  next. 

THEOREM  1:  The  Key  Theorem  of  Level  Crossing 

The  probability  distributions  pj(n)  and  pj(n)  are  related  to  one  another  by: 

(27)  ru  2>-(/)(l  -  G,(n  -  /))  -  rd  £  />/</)<  1  -  F,U  -  n  -  D).  n  >  0 

i~0  i-n  +  l 

where 

G,(k )  -  T  *,(/>.  G,(0>  =  0  and  F,(k)  -  T  /O').  F,(0)  =  0 

j- i  >-« 

are  the  cumulative  distributions  of  g,(-)  and  /(•),  respectively. 

PROOF:  Define 


#  U„  -  #(/.  T(D  <  n.  X(t* )  >  n,  t  €  C/|,  the  number  of  uperossings 

over  level  n  from  and  below  level  w,  and 

#  £>„  =  #{/:  XU~)  >  n  +  1,  X(t+)  <  »  +  1;  /  €  0),  the  number  of 

downcrossings  below  level  «  +  1  from  and  above  level 
n  +  1,  by  the  process  (JO),  /  >  0}  during  its  first 
regeneration  cycle. 

A  simple  inspection  of  the  sample  path  (see  Figure  1)  will  show  that 

(28)  #  f/„  -  #  D„  a.s. 

From  the  definition  of  #  C/„ ,  it  is  clear  that 

£{#£/)  -  £|  £  SUOr/)  <  n  <  X(uTk)  +  Xk)  , 

where  Xk  is  the  batch  size  at  the  Ar-th  uperossing  in  the  first  regeneration  cycle.  Since 
f>(X(uTk~)  <  n  <  X(uTk)  +  X*)  -  »(*(,,  Tk)  *£  n) 

-  h(X{uTk)  +  Xk  «), 
from  the  above  equation  for  £(#{/„},  we  get 

£{#{/,)  -  E  £  8a(u7;-)  </»)-£  6(X(uTk  )  +  Xk  <  n) 

(*-o  *-o 

Now  from  the  "mean  value  equations"  for  p~(n),  pe+(n ).  and  from  (2),  we  have 

(29)  E\#Un)  -  £{r“}  /»„-(/)(!  -  G,(«  -  /)). 
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Using  a  similar  derivation,  we  can  show  that 

(30)  E{#D„)  =  EM  £  £,-(')(!  ~  Ft(i  -  n  -  D). 

/“/l  +  l 

Now  from  (28),  equating  (29)  and  (30)  and  dividing  by  £(£,),  we  get  (27). 

COROLLARIES: 

Next  for  a  special  case  of  the  X(l)  process  we  will  relate  the  up  and  downcrossing  rates  to 
the  steady  state  probability  q(  )  at  an  arbitrary  time.  The  special  case  considered  is  a  piecewise 
Markov  process  (see  Kuczura  (21)).  For  this,  we  assume  that  there  exists  a  sequence  of  ran¬ 
dom  variables  {  Wn) o°  with  0  <  IF,  <  IV2  <  ...  ,  such  that  for  any  n  >  1, 

(/)  tf(/>.  K-i  <  '  <  W»\. 
with  W0  =  0,  is  a  Markov  process 

(//>  PiJ  =  P[X(W„)  -  j\X(W~)  =  /}. 
is  the  same  for  all  n  ^  1,  and 

(///>  Qk(u)  -  P{Wn  -  Wn_,  «|Jir(H^_l)  =  k }, 
is  a  family  of  distribution  functions  of  positive  random  variables. 

CASE  1:  Let  us  first  assume  that  the  process  {Y(/),  /  ^  0)  is  piecewise  Markovian 
between  two  successive  downward  jumps  such  that, 

£{upward  jump  during(/,  /  +  A/JlAT/)  =»«}=»  \(n)tkt  +  0(A/), 

VO  0,  where  lim  — ^  =  0,  is  independent  of  time  /  and  depends  only  on  the  state  of 
X(t).  From  equation  (27)  and  the  rate  of  upcrossings,  we  then  have 

COROLLARY  1: 

£\(/)?(/)(  1  -  G,(n  -  /))  =  />"(/)(  1  -  G,(n  -  /)) 

i-0  <- o 

(31)  =  o  £  prf-(/)(l  -  F,(i  -  n  -  D) 

/— f»— 1 

CASE  2.  For  this  case  we  will  assume  that  A'(r)  is  a  piecewise  Markov  process  between 
successive  upward  jumps  so  that, 

Pfdownward  jump  during!/,  /  +  A/)jY(/)  =  n)  =  ^(«)A/  4-  0(A/), 

«  >  1,  is  independent  of  /and  depends  only  on  X(t).  Similar  to  Case  1, 

COROLLARY  2: 

]T  n(i)q(i)(  1  -  F,(i  -  n  -  D)  =  /7U  (/)(1  -  G,(n  -  /)) 

/-n  +  l  ,-o 

(32)  =  /v  £  AT ('><>  -  £,(/  -«-)) 

f  —  +  I 


»  -• 


I  • 


»  • 


I  • 
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In  the  next  section  we  will  illustrate  the  fertility  of  these  results  by  applying  them  to 
several  queueing  examples. 

3.  APPLICATIONS 


3.1  Queueing  Systems  With  Bulk  Arrival 


Let  us  consider  a  queueing  system  to  which  the  arrivals  are  in  batches  of  size  B.  Let 
P(B  —  n)  =  b„,  n  ^  1.  It  is  assumed  that  the  interarrival  times  from  a  renewal  sequence  with 
mean  1/A .  The  departures,  however,  occur  singly  such  that  the  magnitude  of  a  downward  jump 
is  one.  It  is  assumed  that  all  arrivals  are  accepted.  However,  no  specific  assumptions  are  made 
regarding  the  feedback.  A  special  case  of  this  system  is  a  Glx/G/c  queue  with  the  restriction 
that  departures  occur  singly.  Now  from  (26)  and  (12),  we  get 

(33)  ru  =  A.  and  rd  =  kE(B) 

Now  using  (33)  in  (27),  after  noting  that  p~(n)  =  pd(n)  and  pj (n  +  1)  =  pd(n), 

(34)  £/>-(/)(  1  -  H(n  -  /))  -  E(B)pd(n) ,  n  >  0, 

where 


H(k)  -  £6,  =  P(B  <  k),  H( 0)  =  0. 

j- 1 

Now  taking  the  moment  generating  function  of  (34)  we  get 
Pa-(z)(l  -  H(z)) 


(35) 

where 


(1-  z) 


E(B)Pd(Z), 


Pa  -  Z  z'prin),  P/(z )  -  £  zV(«>. 

h— 0  zj—0 

and 


//(z)  -  £  z"6n 

n-l 

are  the  moment  generating  functions  of  (•).  p/(-)  and  6„.  This  result  relating  the  steady 
state  probability  distribution  seen  by  an  arriving  batch  and  a  departing  customer  immediately 
extends  the  key  result  (equation  (D)  derived  by  Chaudhry  [7]  for  an  Mx/G/\  queue  to  systems 
even  more  general  than  a  GI*/ G/c  feedback  queue.  Now  let  us  consider  two  special  cases  of 
the  above  model:  (i)  Glx/ Ml c  and  (ii)  Mx/G/c. 


3.1.1  Glx/M/c  Queue 

Let  the  service  times  be  exponentially  distributed  with  mean  \/p.  and  the  feedback  proba¬ 
bility  /„  —  /,«>  1.  Then  from  corollary  2,  with  p,(n)  -  min(n,c)/a(l  -  /),  and  (34),  we  get 

(36)  A  />„-(/)(  1  -  H(n  -  /))  -  min(/i  +  l,c)*t(l  -  f)q(n  -I-  1) 

Result  (36)  is  a  generalization  of  the  results  given  in  Takacs  [33 J  for  the  Gl/M/c  gueue. 
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3.1.2  Mx/G/c  Queue 

Here  we  assume  that  ihe  arrivals  of  batches  form  a  Poisson  process  with  rate  K.  Then 
from  Corollary  1  we  have  q(n)  =  Pi(n)  and  from  (35)  we  gel 

(37)  Q(z)  =  P;(z)  =  (£(fi)(l  -  z)|P/(z)/(l  -  H(z)), 

oo 

where  Q(z)  =  £  z"q(n),  and  others  are  as  defined  earlier.  Result  (37)  was  derived  by  Chau- 

rt— 0 

dhry  [17]  (Equation  (D)  for  the  A/*/ (7/1  queue. 

Similar  results  can  be  derived  for  the  bulk  service  queues. 

3.2  Queues  With  Instantaneous  Feedback 

Now  we  will  consider  a  queueing  system  with  single  arrivals  and  departures.  That  is, 
gn(\)  =  f„(l)  =  1  V«.  We  will  also  assume  that  the  probability  of  blocking  g„  is  zero  and  pro¬ 
bability  of  feedback  /„  is  a  constant  equal  to  /,  V/i  >  1.  A  special  case  of  this  system  is  the 
G//G/c  queue  with  instantaneous  feedback.  Let  the  mean  inter-arrival  time  be  l/\.  Recently 
Disney,  McNickle  and  Simon  [15]  considered  the  M/G/\  queue  with  instantaneous  feedback. 
They  obtained  relationships  between  the  probabilities  at  different  event  epochs  using  Markov 
Renewal  theory.  Our  result  will  generalize  theirs.  From  (26)  and  (12),  we  have  ru  —  rd.  Now 
from  the  key  theorem  of  level  crossing,  and  noting  that  pd(n)  *  Pi(n), 

(38)  Pi(n)  -  Pi  (n  +  1)  =»  pd(n),  n  ^  0, 

Now  from  (18)  and  (38), 

(39)  Pf+(n)  “*  (1  -  f)Pd(n)  +  f  p/(n  -  1),  n  >  1. 

From  (4),  (9),  and  (10), 

(40)  pf{n)  *=  pi (n)  =*  pi (n),  n  >  1. 

Also  from  (6), 

pc+  ( n )  —  pi  ( n  +  1 )  ( 1  -  /)  4-  pi  ( n  )f  n  >  I ) 

(41)  =  p ~ ( l ) ( 1  -  /),  n  =  0  j 

So  having  evaluated  /»/(■)  from  embedded  Markov  chain  analyses,  the  remaining  probability 
distributions  can  be  evaluated  using  (38)-(41). 

3.3  Queueing  Systems  With  Limited  Waiting  Space 

In  this  section  we  consider  the  same  model  as  3.2  except  that  the  capacity  of  the  system  is 
K.  That  is,  g„  *  I,  n  >  K  and  —  0,  n  <  K.  Then  from  (7)  we  get 

(42)  pjn)  =  pi(n)/(  1  -  pi(K)),  0  n  <  K 

and 

Pi(n  +  1)  —  Pi(n)  ”  0,  n  ^  K. 

Now  from  the  key  theorem  of  level  crossings  and  noting  that  ru  -  rrf, 

(43)  Pi(n)  =  Pi(n  +  1),  0  <  n  <  K. 

Next  we  will  consider  two  special  cases. 
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.U  /  (,1/M/c/K  Queue 

I  rum  Corollary  2  we  gel,  after  noting  that  p(/i>  =  min(«,c)p(l  —  /),  n  =  1.2 . K , 

*44)  r„pu  (//  —  1)  =  minGr.c )p  ( I  -  f)q(n),  /;  =  I,  2 . A". 

I  n  .0  *42 )  and  noting  that  ru  =  A ( 1  —  p~(K)),  we  get 

*45)  \p~  (n  -  1)  =  min  (w,c)p(  1  —  f)q(n) ,  n  =  1,2,  . . .  ,  A7 

1  he  above  result  agrees  with  that  of  Hey  man  and  Stidham  (18]. 


J.J.J  M/G/l/K  Queue 


I  rom  (42),  (43),  and  Corollary  1  we  get 

*46)  Kq(n)  rup~{n),  n  -  0,  1,  2 . K  —  l, 

-  GP/(n  +  1). 

Now  noting  that  rd  =  p(l  —  <7 (0) ) ,  we  get  from  (46) 


(47)  qin)  — 

where  p  —  A/p.  So 

4r(0)  = 

*48)  </(«)  = 

and 


(1  —  <7  (0) ) («  +  1) 


.  n  -  0,  1,  2,  ...  ,  K  -  1, 


p/(0) 

p  +  p/(0) 

1 


p  4-  pd+( 0) 


p/(«).  n  =  1.  2 . /C  —  1 , 


<?(/0  -  1  -  *£  </(«) 

»- 0 

give  the  steady  slate  probability  distribution  at  an  arbitrary  time  epoch.  See  Gross  and  Harris 
(17.  page  252],  for  an  alternate  derivation  of  the  same  results. 


4.  MACHINE  INTERFERENCE  PROBLEMS 


In  this  section  we  apply  the  level  crossing  technique  to  machine  interference  models 
w  Iik  n  are  queueing  models  with  finite  capacity  sources,  and  derive  some  new  results  for  this 

model. 

4.1  M/Ci/1/  /N  Queue 

'Ac  consider  here  a  queueing  model  with  a  single  finite  source  model  of  capacity  .V  and  a 
small  server.  The  arrival  rate  of  customers,  when  there  arc  n  customers  at  the  service  facility, 
is  a  *  v  -  n)  Besides,  =  I  when  n  ^  ,V  and  0  when  n  <  ,V,  and  pa~(n)  =  0  if  n  >  N.  So 
we  Mom  (7) 

Pu  <  n )  =  Pa  ( >1 )  ■  0  5?  n  <  .V 

,  4 1  j  • 

=  0  Otherwise. 


N  W  VI  RESEARCH  LOGISTICS  OUARTLRl  Y 


VOL  29.  NO  4.  DEC  EMBER  1982 


LEVEL  CROSSING  ANALYSIS  IN  QUEUEING  SYSTEMS 


605 


from  (26),  ru  =  rd. 

From  Corollary  1,  and  because  of  (49),  we  get  for  0  <  «  <  /V 
(50)  A in)  qin)  =  rup~(n) 

=  rd  pj  ( /;  +  1). 


Let  1/p  represent  the  mean  service  time  of  a  customer,  and  p  the  utilization  of  the 
server.  Then, 


^  =  MP- 


Because  pd  (n  +  1)  =  pd(n),  we  get  from  (50),  for  0  <  n  <  N, 

Kin)  qin)  =  pppj(n). 

So 


and 

(51) 


,  ,  MP  P/(«>  ,  A  ^ 

qin)  =  - ; — - —  for  0  <  n  <  N 

Kin) 


.v—  i 

q(N)  —  \  —  £  qin). 

n— 0 


These  results  agree  with  equation  (15)  of  Courtois  and  Georges  1 1 41  and  equation  (10)  of 
Chow  [81. 

4.2  M/G/1//N,,  N2,  ...Nk  Queue 

We  now  extend  the  results  to  a  multiple  finite  source  queueing  model  with  K  finite 
sources  and  a  single  server.  The  capacity  of  the  /  th  source  is  N,.  The  server  can  follow  any 
work-conserving  service  discipline  for  selecting  customers  for  service.  Relations  are  obtained  in 
the  following  subsections  with  respect  to  marginal  and  joint  probabilities. 

4. 2. 1  Marginal  Probabilities 

We  consider  each  source  separately  and  apply  results  obtained  in  Section  4.1.  We  define 

Pd*(nk)  —  lim  />(T*(f,+  )  -  nk) 

r—ao 

where  Xkit)  denotes  the  number  of  source  k  customers  present  at  time  t,  t  ^  0,  and  £r  is  the 
service  completion  epoch  of  the  rth  type  k  customer.  So  from  (51)  we  get,  for 
k  -  1,  2 . K, 


and 

(52) 


'  Pd*  if  0  nk  <  Nk 


Nk  -  1 

qk(Nk)  -  1  -  £  qk(i) 
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where  qk(nk )  is  the  marginal  time  average  probability  of  finding  nk  number  of  source  k  custo¬ 
mers  at  the  facility,  pk  is  the  proportion  of  time  the  server  is  busy  with  source  k  customers  and 
1  /fj.k  is  the  mean  service  time  of  a  source  k  customer.  Then  rd  =  nkpk  for  class  £  source. 

4. 2. 2  Joint  Probabilities 


In  this  case,  we  represent  the  state  of  the  system  X(t)  as  a  vector  n  =  (nx,  n2 . %), 

where  nk  refers  to  the  number  of  source  k  customers. 


From  an  extension  of  Corollary  1,  for  the  multidimensional  discrete  state  processes,  we 

A  A 

can  show  that,  for  the  compound  state  (i;  ij  4  njt  j  =  1,  2,  ....  K }  with  £  n2  ^  £  Nj, 

j- t  J- i 


£2  Kk(nk) 

k- 1 


-l,  n2  "k-1  nk+l 


j  £  '  £  22  ' "  1 9  '2.  —  4-t.  nk>  4+1-  •  •  ■  -  '*) 

/|-Oi2-o  'ji-i-0 'k~° 


(53) 


=  J^PkPk 

k- 1 


"1  "2  "*-t  "*•;)  "*  + 

£  £  £  £  £>a  (/,. /2. ....  4-i.  . 4r) 


where  pjj+in)  =  lim  P(.Y(f*)  =  n). 

r— °° 

Rearranging  the  terms,  (53)  becomes 

a  + 

q(n)k(n)  =  £  Pd  ^PkPk  ~  £2 
*-i 


(54) 


(55) 


AT 

I 

*-I 


n,  n2  nk  +  . 


II  I  I  I  (<?(<i-  h . 4-1.  nk •  4+t . ‘x 

'l*"0  <2—0  ik_ |  'k4.|“0  'Af”° 


-  Pd*(i\.  h . 4-1-  nk>  4+i.  •••»  'aWp*) 

A  A 

!'/*£>; 

./-i  /-i 


<j(/Y,,  Afj . Nk)-  1- 


,v,  v2 


£  £  ...  £2  q(n,,  n2 . njc> 


i «,  *  i  *k- 

k-\  k- 1 


Details  of  the  numerical  algorithms  using  the  above  relations  for  calculating  the  time  average 
probabilities  q(n)  are  given  in  [6].  A  detailed  derivation  of  these  and  other  results  with 
numerical  examples  will  be  reported  elsewhere. 
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5.  CONCLUSIONS 

In  this  paper  we  have  presented  some  basic  results  concerning  level  crossing  analysis 
applied  to  discrete  state  processes.  By  applying  these  results  to  some  queueing  systems,  we 
obtained  relations  between  lime  average  probabilities  and  the  steady  state  probabilities  at  arrival 
or  departure  epochs. 

The  queueing  systems  considered  in  this  paper  are  but  a  few  among  the  vast  number  of 
cases  where  the  level  crossing  analysis  can  be  used  to  relate  the  steady  probabilities  associated 
with  different  event  epochs.  The  most  important  feature  of  this  analysis  is  that  the  level  cross¬ 
ing  analysis  is  a  more  general  and  simple  tool  which  can  be  applied  to  many  queueing  systems. 
Application  of  this  discrete  state  level  crossing  analysis  has  already  proven  to  be  very  fruitful  in 
different  cases  (see  (29-32]). 
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ABSTRACt 

A  simple  formula  is  found  to  be  just  as  accurate  as  a  complicated  one  for 
estimating  the  probability  of  detection  achievable  by  an  ingenious  searcher  pa-  # 

trolling  a  channel  or  barrier.  The  difference  between  "detection"  and  "closure" 
is  emphasized  u  an  extension. 


INTRODUCTION 

A  searcher  whose  speed  is  v  and  who  can  see  any  target  within  some  distance  R  patrols  a 
channel  of  width  L  looking  for  targets  that  come  down  the  channel  with  speed  u.  By  "patrols," 
we  mean  he  selects  any  closed  path  and  travels  around  it  repeatedly;  the  endurance  of  the 
searcher  is  not  an  issue.  The  question  is,  "What  is  the  best  closed  curve,  and  what  fraction  of 
the  targets  are  detected?"  If  v  <  w,  it  is  also  of  interest  to  find  the  fraction  of  targets  that  can 
be  closed  (touched)  by  the  searcher,  assuming  that  targets  continue  to  move  in  a  straight  line 
down  the  c'tannel.  The  problem  is  an  old  one,  with  the  classic  reference  being  [1],  In  most 
naval  applications,  one  role  or  the  other  is  played  by  a  submarine.  The  "channel"  sometimes 
corresponds  to  that  portion  of  a  wide  barrier  that  has  been  assigned  to  a  particular  searcher. 

Our  approach  to  this  problem  will  be  to  first  show  that  there  is  an  upper  bound  on  the 
probability  of  detection,  corresponding  to  a  particular  infeasible  patrol  path  for  the  searcher. 
We  will  then  argue  that  there  are  feasible  patrol  paths  for  which  the  probability  of  detection  is 
quite  close  to  the  upper  bound.  Finally,  we  shall  provide  a  simple  geometric  analysis  of  the  clo¬ 
sure  problem  based  on  the  infeasible  patrol  path. 

AN  UPPER  BOUND  ON  THE  DETECTION  PROBABILITY 

It  is  convenient  to  imagine  that  the  targets  are  fixed  to  a  tape  moving  down  the  channel 
with  speed  «,  in  which  case  the  probability  of  detection  for  randomly  distributed  targe.s  is  just 
the  ratio  of  the  rate  at  which  the  searcher  can  examine  new  area  on  the  tape  to  the  rate  uL  at 
which  new  tape  area  appears.  Let  w(t)  be  the  speed  of  the  searcher  with  respect  to  the  tape  at 
time  /.  Then  the  rate  at  which  the  searcher  examines  new  area  on  the  tape  certainly  does  not 
exceed  2 Rw(t).  If  the  searcher  is  at  the  same  position  on  his  path  at  time  T as  at  time  0,  and 
if  A  is  the  amount  of  new  area  examined  between  0  and  T,  then  necessarily 

(1)  A  <  2R  fT  w{t)dt. 

•f  o 
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Let  v  cos  #(/)  be  the  component  of  the  searcher’s  velocity  in  the  same  direction  as  the  tape  is 
moving.  Since  the  searcher  returns  to  his  initial  position  at  time  T \ 

J  cos  f)(r)dt  =  0. 

According  to  the  Law  of  Cosines, 

h  (r)  =  'J u2  +  v2  —  2 u  v  cos  H(t). 

The  fact  that 

w(t)  ^  \/w2  +  v2  —  u  — /■  V  =-cos  fl(r) 
v  u2  +  v2 


is  easy  to  show  by  observing  that  the  right-hand  side  is  positive  and  then  squaring  both  sides. 
Consequently, 

(2)  A  2RTy/u 2  +  v2. 


Since  the  amount  of  new  tape  that  appears  in  time  T  is  uLT \  this  leads  to  an  upper  bound  on 
the  probability  of  detection  p. 


(3) 


p  <  min 


1,  (2R/L)y/\  +  (v/w)2 


CONVEYOR  BELT  SEARCH 

In  order  to  achieve  the  upper  bound  (3),  it  is  evidently  necessary  that  cos0(r)  *=  0;  that 
is,  the  searcher’s  motion  must  always  be  perpendicular  to  the  sides  of  the  channel.  Unfor¬ 
tunately,  the  only  way  for  the  searcher  to  do  this  is  to  simply  move  back  and  forth  across  the 
channel,  and  this  type  of  motion  is  inefficient  for  fast  searchers  because  of  the  large  amount  of 
overlap  with  previously  searched  parts  of  the  tape.  The  situation  would  be  different  if  the  tape 
were  made  into  a  tube  by  fastening  the  edges  together,  in  which  case  the  searcher  could 
proceed  perpendicularly  to  the  target  tracks  without  ever  having  to  reverse  his  course.  The 
effect  of  this  would  be  the  same  as  if  there  were  a  conveyor  belt  of  searchers  spaced  L  apart 
moving  across  the  channel.  Relative  to  such  a  belt,  the  tracks  of  the  targets  would  differ  from 
being  normal  to  the  belt  by  an  angle  0  whose  tangent  is  v/u  (see  Figure  1).  From  Figure  1,  it 
is  apparent  that  a  fraction  2 R/ L  cos  0  of  all  such  tracks  would  encounter  a  searcher,  assuming 
that  tracks  are  equally  likely  to  penetrate  the  belt  anywhere,  and  also  assuming  2 R  <  L  cos  0. 
But  this  is  the  same  as  (3).  Thus,  a  conveyor  belt  (CB)  search  of  a  channel  produces  the  larg¬ 
est  possible  probability  of  detection.  In  most  circumstances,  however,  a  CB  search  is  not  feasi¬ 
ble. 

APPROXIMATELY  OPTIMAL  SEARCH  PATHS 

In  order  to  avoid  the  overlap  caused  by  the  back  and  forth  (BF)  search,  a  fast  searcher 
should  move  paratlel  to  the  channel  near  the  edges.  For  example,  he  might  choose  a  bow  tie 
(BT)  shaped  path  as  in  Figure  2.  In  [1],  the  choice  was  to  make  D  —  R  and  a  such  that 
v  sin  a  -  u.  The  probability  of  detection  was  compared  with  that  for  a  -  0  (which  corresponds 
to  BF),  and  BT  was  found  to  be  preferred  to  BF  for  fast  searchers.  Specifically,  letting  r  —  v/u 
and  A  —  L/2R  -  1,  the  results  were: 
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Prt  =  1  + 


P  RF  —  1  —  IX  — 


r?  +  i-i 


for  r  >  1 


/X(A  +  1), 


except  that  the  probability  cannot  exceed  1  in  either  case.  A  proof  of  (4)  and  (5)  is  available  in 
[3],  It  is  of  interest  to  compare  (4)  with  (3)  when  r  is  large  and  (5)  with  (3)  when  r  is  small. 
The  analytic  expression  in  (3)  is 

VTT7  _  n 


•  • 


in  the  present  notation.  Letting 


f\  =  v  1  +  rl 


/2=  1  + 


it  can  be  shown  that 


n  - 1\  - 


(1  +  r)  (r  +  V r2  —  1) 

from  which  it  follows  directly  that  fx  >  f2  and  lim,  _  „  (2r2)  (/,  -  f2)  =  1.  Thus,  PCB  >  /Vr, 
and  the  difference  becomes  very  small  for  large  r.  Also,  it  is  simply  a  matter  of  expanding  the 
square  in  (5)  to  show  that 


Pen  ~ 


Consequently,  PcB  ^  P BF’  and 
4\(\  +  1)  , 


'r2  +  1  -  1 


'a(x  +  I). 


P bf)  —  I' 


i.e.,  the  difference  between  the  two  becomes  very  small  for  small  r.  Figure  3  shows 
P CB  P BT.  and  PBF  as  a  function  of  r  for  A  =  2. 

The  fact  that  /^-b  ^  max  (PB F,  PBt )  does  not  follow  directly  from  the  fact  that  P CB  is  an 
upper  bound.  The  reason  for  this  is  that  corners  were  "squared  off'  in  [1]  for  ease  of  computa¬ 
tion,  which  has  the  effect  of  computing  too  large  a  detection  probability.  The  author  has 
repeated  these  calculations  with  two  changes: 

(1)  "round  corners"  were  used 


(2)  all  angles  a  such  that  0  <  v  sin  a  <  u  were  explored,  and  the  optimal  a 
selected  for  each  (k,r). 

The  computations  are  reported  in  detail  in  [4],  For  X  =  2,  the  optimized  detection  probability 
is  shown  in  Figure  3  as  P0P.  The  result  is  typical;  POP  is  very  nearly  PCB  except  when  PCB  is 
large.  Furthermore,  it  seems  clear  that  even  this  shortfall  could  be  eliminated  by  also  permit¬ 
ting  D  <  R  (see  Figure  2),  since  the  probability  of  detection  for  targets  near  the  channel  edges 
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r  =  v/u 

Figure  3  —  Comparisons  with  ihe  Conveyor  Belt  Bound 


will  be  small  as  long  as  D  =  R.  Erickson  [2]  has  done  some  computations  along  these  lines. 
Since  the  searcher  is  actually  allowed  to  use  any  closed  path  whatever,  rather  than  only  the  class 
of  generalized  bow  ties  that  we  have  permitted  him,  it  seems  reasonable  to  come  to  the  conclu¬ 
sion  that  there  exists  a  search  path  that  is  close  to  the  conveyor  belt  in  terms  of  probability  of 
detection,  provided  the  probability  of  detection  is  not  too  close  to  1.  It  follows  from  this  that 
formula  (3)  is  generally  a  better  assumption  than  either  (4)  or  (5),  for  reasons  of  both  simpli¬ 
city  and  accuracy.  The  geometric  simplicity  of  the  conveyor  belt  search  also  makes  various 
extensions  relatively  easy.  An  example  of  such  an  extension  is  given  in  the  next  section. 

THE  PROBABILITY  OF  CLOSURE 

Suppose  there  is  a  conveyor  belt  of  searchers  looking  for  targets  that  move  in  straight 
lines  normal  to  the  conveyor  belt.  Upon  detecting  a  target,  a  searcher  attempts  to  catch  (close) 
it  at  some  speed  v'  that  may  exceed  his  patrol  speed  v.  If  v'  >  w,  he  will  certainly  succeed,  but 
if  v'  <  w,  he  will  succeed  only  if  the  detection  is  made  in  a  certain  sector  of  the  detection  circle 
as  illustrated  in  Figure  4,  where  cos  0  *-  v'/w.  A  target  will  ultimately  be  caught  if  and  only  if  it 
touches  the  shaded  part  of  some  detection  circle.  As  in  Figure  1,  the  targets  arrive  relative  to 
the  conveyor  belt  at  angle  <j>.  The  geometry  of  computing  the  closure  probability  depends  on 
the  relative  magnitudes  of  d>  and  i /».  Figure  4  illustrates  the  case  where  d>  ^  i //,  and  Figure  5 
the  case  where  <t>  ^  Noting  that  cos(i h  -<{>)  +  cos(t /»  +  <t>)  =  2  cos  <t>  cos  i h,  we  see  that 


(6) 


(2 R/ L)  tan  <f> 


Pc  = 


(2 Rf  L )  cos  *li 


I  +  cos  (tj)  +  <f>) 
L  cos  <t> 


if  v  >  u 

if  v  <  u  and  <  i|» 
if  v  ^  u  and  >  i/». 
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Rcos  (^+<£) 


Figure  4  —  Closure  Geometry  when  <  ii 


Figure  5  —  Closure  Geometry  when  <t>  >  <l> 


Figure  6  shows  Pc  vs  w/v  when  v'  =  v  and  R/L  -  .1.  The  downward  jump  when  w/v  *»  1 
shows  that  there  is  a  definite  benefit  to  the  target  for  being  faster  than  the  searcher.  Of  course, 
increases  in  w  may  also  affect  R ,  particularly  if  the  detection  mechanism  is  based  on  sound 
waves,  so  it  does  not  follow  that  the  best  strategy  for  the  target  is  to  go  as  fast  as  possible. 
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ABSTRACT 

This  paper  studies  Lanehesier-type  combat  between  two  homogeneous 
forces  modeled  by  t he  so-called  general  linear  model  with  continuous 
replacements/withdrawals  It  demonstrates  that  this  model  can  be  transformed 
into  a  simpler  canonical  form,  which  is  also  shown  to  arise  from  fixed-force- 
levcl-breakpomt  battles  modeled  by  Lanchesier-type  equations  for  modern  war¬ 
fare.  Analytical  expressions  for  the  force  levels  for  the  general  variable- 
coefficient  linear  model  with  continuous  replacemenls/withdrawals  are  con¬ 
structed  out  of  so-called  general  Lanchester  functions  for  the  model  without 
replacements/withdrawals,  for  which  all  solutions  are  shown  to  be  nonoscillato- 
ry  in  the  strict  sense  These  force-level  results  are  unfortunately  so  complicat¬ 
ed  and  opaque  that  the  constant-coefficient  version  of  the  model  must  be  stu¬ 
died  before  any  insights  imo  the  dynamics  of  combat  may  be  analytically  ob¬ 
tained  Thus,  fairly  complete  results  arc  given  for  the  general  linear  model 
with  constant  aliriiion-ratc  coefficients  and  constant  rates  of 

rcplacement/withdrawal  However,  the  expressions  for  the  force  levels  arc  still 
so  complicated  that  we  have  not  been  able  to  develop  battle-outcome  prediction 
conditions  directly  from  them  alone  but  have  had  to  establish  general  results 
on  the  qualitative  behavior  of  solutions  A  significant  result  (and  one  that 
greatly  complicates  the  prediction  of  battle  outcome)  is  that  all  solutions  to  the 
model  wuh  replacemenls/withdrawals  are  no  longer  necessarily  nonoscillatory 
in  the  strict  sense  i  e  .  bolli  sides'  force  levels  can  lake  on  negative  values  if 
the  force-on-force  attrition  equations  are  not  "lurned  off  at  the  right  lime 
Thus.  this  paper  shows  that  the  addition  of  continuous 

replacemenls/withdrawals  10  a  Lanchesier-type  model  may  significantly  change 
the  qualitative  behavior  of  the  force-level  trajectories  Battle-outcome  predic¬ 
tion  conditions  are  nevertheless  given,  and  important  insights  into  the  dynam¬ 
ics  of  combat  are  briefly  indicated 


1.  INTRODUCTION 

Deterministic  Lanchesier-type  combat  models  (so-called  after  pioneering  work  by  F.  W. 
Lanchester  [12])  are  widely  used  today  in  both  the  United  States  [16],  [19],  [22]  and  also 
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(HQUSAF/SAZ)  and  partially  by  the  Office  of  Naval  Research  as  part  of  the  Foundation  Research  Program  at  the  Na¬ 
val  Postgraduate  School 


VOL  29,  NO  4,  DECF.MBF.R  1982 


617 


NAVAl  RESEARCH  LOGISTICS  QUARTERLY 


618 


J.  G.  TAYLOR 


NATO  countries  [8],  [91  for  defense-planning  purposes.  The  modern  high-speed,  large-scale 
computer  has  made  possible  the  development  and  use  [21  of  quite  complex  differential 
equation-based  operational  models  of  combat  operations  (e  g.,  see  [41).  It  is  indeed  surprising, 
therefore,  that  relatively  little  effort  has  been  devoted  to  the  mathematical  analysis  of  various 
Lanchester-type  paradigms  (see  [15,  p.  881)  from  which  these  complex  computer-based  combat 
models  have  been  developed  by  the  process  of  model  enrichment  (e  g.,  see  [31,  [131,  or  (16l). 
Accordingly,  this  paper  will  theoretically  investigate  the  qualitative  behavior  of  a  relatively  sim¬ 
ple  Lanchester-type  mode!  with  replacements  to  develop  some  important  insights  into  the 
dynamics  of  combat. 

How  do  replacements  and  withdrawals  affect  the  course  of  a  battle?  How  fast  should 
troops  be  pot  ashore  in  an  amphibious  landing  to  insure  its  success?  Will  reinforcements  reach 
NATO  in  time?  These  are  important  questions  both  for  the  military  tactician  and  also  the 
defense  planner.  In  this  paper  we  will  study  a  relatively  simple  Lanchester-type  model  that 
explicitly  considers  continuous  replacements  and/or  withdrawals  in  order  to  provide  some  basic 
insights  into  such  issues  as  delineated  above.  Our  basic  analysis  strategy  is  to  use  a  simplified 
auxiliary  model  for  understanding  the  basic  dynamics  and  benavior  of  a  large-scale  complex 
operational  model  [61,  [161.  This  has  been  the  basic  motivation  for  studying  the  relatively  sim¬ 
ple  Lanchester-type  paradigm  considered  here.  Also,  many  times  one  can  fit  such  a  simple 
analytical  model  to  data  generated  by  a  detailed  combat  simulation,  and  thus  a  simple  analytical 
model  such  as  we  consider  here  may  provide  an  economical  framework  for  summarizing  simu¬ 
lation  output  data  [5],  [10], 

Thus,  this  paper  theoretically  investigates  a  relatively  simple  Lanchester-type  model  that 
explicitly  considers  replacements  and/or  withdrawals.  Essentially,  no  analytical  work  (except 
for  the  rather  dated  work  of  Morse  and  Kimball  [14l  and  Karns  [111)  has  been  previously  done 
on  such  models.  In  particular,  no  previous  theoretical  work  has  been  reported  on  the  qualita¬ 
tive  behavior  of  solutions  to  such  models.  Our  new  theoretical  results  show  that  the  qualitative 
behavior  of  force-level  trajectories  is  fundamentally  changed  by  the  inclusion  of  continuous 
replacements  and/or  withdrawals  in  a  Lanchester-type  combat  model. 

2.  THE  GENERAL  LINEAR  MODEL  FOR  LANCHESTER-TYPE 
COMBAT  BETWEEN  TWO  HOMOGENEOUS  FORCES 

In  this  section  we  will  briefly  examine  the  general  linear-differential  equation  model  for 
combat  between  two  homogeneous  forces.  Special  cases  of  this  general  model  will  be  examined 
in  more  detail  in  subsequent  sections  of  this  paper. 

Accordingly,  we  consider  the  following  Lanchester-type  equations  for  xand  y  >  0 
dx 

—  -ait)y  -  pit)x  +  rit)  with  x(0)  -  •*■(), 

«'»  & 

dt  -  -bit)x  -  ait)y  +  sir)  with  .y(O)  =  y0, 

where  xit)  and  y(/)  denote  the  A' and  Y  force  levels  at  time  r,  and  air)  and  bit)  denote  Lan- 
chester  attrition-rate  coefficients,  which  represent  the  fire  effectiveness  of  a  single  "typical"  firer 
on  each  side.  There  are  several  different  sets  of  physical  and  operational  circumstances  to 
which  the  model  (2.1)  may  be  hypothesized  to  apply,  and  the  coefficients  ait),  pit),  rir),  and 
sir)  have  different  physical/operational  interpretations,  depending  upon  the  context  in  which 
the  model  is  viewed.  We  will  now  discuss  several  such  possibilities. 
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The  term  rit)  in  the  first  of  Equations  (2.1)  can  model  either  (1)  the  replacement  rate  for 
the  X  force  (with  a  negative  value  representing  a  net  continuous  withdrawal),  or  (II)  the  attri¬ 
tion  [in  which  case  r(f)<0]  of  the  X  force  from  exogenous  fires  (not  subject  to  attrition)  at  a 
rate  not  dependent  on  Xs  force  level.  Similar  remarks  apply  to  sit).  For  simplicity,  however, 
we  will  consider  only  the  first  possibility  here  and  will  consequently  refer  to  rit)  and  sir)  as 
replacement  rates.  Even  within  this  context,  two  different  tactical  situations  may  again  be 
hypothesized  to  yield  the  above  equations  (2.1)  (cf.  Figure  4  of  Taylor  [19]): 


either  (SI)  "aimed-fire"  combat  between  two  homogeneous  forces  with  "operational" 
losses  and  with  continuous  replacements, 

or  (S2)  "aimed-fire"  combat  between  two  homogeneous  primary  forces  (or 

infantries)  with  superimposed  effects  of  supporting  fires  not 
subject  to  attrition  and  with  continuous  replacements  for  the 
primary  forces  (see  Figure  1). 


Figure  I  "Aimed-fire"  combat  between  (wo  homogeneous  primary  forces  (infantries)  with  superimposed  effects  of  sup¬ 
porting  fires  (here,  from  artillery)  not  subject  to  attrition  and  with  continuous  replacements  for  the  primary  forces 


In  the  second  case  (S2),  it  is  assumed  that  each  side  uses  "aimed"  fire  and  that  target- 
acquisition  times  do  not  depend  on  the  number  of  enemy  targets.  The  supporting  weapons  are 
assumed  to  employ  "area"  fire  against  enemy  infantry  (see  Taylor  [19]  for  a  more  thorough  dis¬ 
cussion  of  modeling  assumptions;  also  [17],  [20]).  In  this  case,  determination  of  numerical 
values  for  the  attrition-rate  coefficients  air)  and  /3 (/),  modeling  the  effects  of  the  supporting 
fires,  follows  along  the  lines  discussed  in  Taylor  [19,  Section  6.3]  .  In  the  simplest  instance  we 
then  have  that,  for  example,  air)  =  aLv  Ay,  where  the  X  force's  artillery  is  denoted  as 
U.  aLv  denotes  the  lethal  area  of  a  single  U  artillery  round,  vv  denotes  the  U  firing  rale  per 
lube,  uq  denotes  the  U  force  level  (which  is  constant  because  it  is  assumed  that  the  U  force 
suffers  no  losses),  and  Ay  denotes  the  presented  area  of  the  region  occupied  by  the  K force. 

Mathematically,  we  make  the  following  assumptions  about  the  attrition-rate  coefficients 
and  replacement  rates  in  the  model  (2.1): 

(Al)  ait )  and  bit)  are  defined,  positive,  and  continuous  for  t  6  (/n.  +°°)  with 

ro  ^  0, 
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(A2)  ait)  and  pit)  ^  0  for  /  €  [r0,  +oo), 

(A3)  ail),  bit),  ait),  pit),  rit),  and  sit)  €  Lit0,  T)  for  any  finite  T  >  tn. 

Here  the  notation  ail)  €  L(t0,  T)  is  used  to  mean  that  f  a  it)dt  exists  (and  is  given  by  a 

J  'o 

finite  quantity).  Further  details  are  to  be  found  in  [17],  [20],  It  follows  from  (A3)  that  any 
force-level  trajectory  to  (2.1)  for  a  given  set  of  initial  conditions  is  continuous  and  unique.  We 
will  place  no  further  restrictions  on  the  replacement  rales  rit)  and  sir),  and,  consequently, 
negative  values  are  possible  for  them.  We  will  further  assume,  however,  that  air)  and  bit) 
are  given  in  the  form  [18!,  [201 

(2.2)  ait)  =  ktlnii)  and  bii)  =  kbhit), 

where  ka  and  kb  are  positive  constants  chosen  so  that  ail)/ bit)  =  kj  kh  when 
d{a  it)/ bir)}/dt  =  0.  In  other  words,  ka  and  kb  arc  basically  "scale  factors,"  which  are  useful 
for  parametric  study  of  battle  outcome  as  related  to  various  system  parameters  [18]  (see  also 
[201).  It  is  then  convenient  to  introduce  the  combat-intensity  parameter  k  t  and  the  relative-fire- 
effectiveness  parameter  defined  by 

(2.3)  A/  =  V ka kb  and  kR  =  — . 

Kb 

3.  TRANSFORMATION  TO  A  CANONICAL  FORM 

No  results  have  previously  appeared  in  the  literature  for  the  general  model  (2.1)  with 
variable  attrition-rate  coefficients.  We  will  now  show  that  (2.1)  may  be  transformed  into  a 
simpler  canonical  form  to  which  results  [16],  [18]  for  variable-coefficient  Lanchester-type  equa¬ 
tions  of  modern  warfare  (3.5)  may  be  applied.  Thus,  this  model  (3.5)  is  really  basic  for  study¬ 
ing  a  wide  variety  of  combat  situations  (cf.  Sections  5.4  and  5.5  of  Taylor  [16]).  The  substitu¬ 
tion 


pit)  -  .xit)  exp  j  I  pis)ds\,  qit)  =  yit)  exp  |  I  a (s )</$(, 


transform  (2  1)  into 
(  dp 


=  —  A  it)q  +  yit)  with  pi 0)  —  *<), 
dq 

(jt  =  —  B 1 1) p  +  fVit)  with  qi 0)  =  >'0, 


where 


Air)  =  alt)  exp  1 

1 

t 

It  I 

(3.3) 

Bit)  =  bit)  exp  • 

r 

l 

and 

i 

t» 

L 

(3.4) 

kit)  =  rit)  exp 

1  1 

i 

1. 

L 
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The  transformation  (3.1  >  is  motivated  b\  looking  for  an  "integrating  factor"  lor.  for  example, 
the  first  equation  of  (2.1).  as  vv  riling  </v/ dt  +  fiit)x  =  —  a  ( t  )>'  +  r  ( t )  suggests  to  one. 

As  we  have  seen  above  in  Section  2,  we  may  consider  liquations  (3  2)  to  model  "aimed 
fire"  combat  between  two  homogeneous  forces  with  continuous  replacements.  However,  there 
is  another  very  important  set  of  circumstances  that  leads  to  equations  of  this  form  Consider 
aimed-fire  combat  between  two  homogeneous  forces  modeled  by  Lanchester's  equations  lor 
modern  warfare  (for  vand  y  >  0) 


(3.5) 


~r~  =  —  ait)v  with  .v(0)  ~  -vo. 
dt 

dy 


=  —  b(t)x  with  r (0)  =  >'n. 

In  this  model  the  state  variables  x(t)  and  yit)  are  the  numbers  of  combatants  that  are  effective 
on  each  side.  Consider  now  a  fixed-force-ievel-breakpoint  battle  [16],  [19]  If  we  introduce 
new  state  variables  .VO)  and  )  0)  defined  by 

(3.6)  .VO)  =  xit)  —  Xjjp  and  K(/)  =  >  0)  —  y0P . 

where  xHP  and  vBP  denote  the  X  and  Y  force-level  breakpoints,  then 


(3.7) 


dX 


v0>  with  .V(0)  -  -v0  xhp- 


dY 

dl 


=  -bit)X  -  w ( r )  with  Y(0)  =  —  \ 


HP’ 


where  v ( r )  =  ait) yBf,  and  wit)  =  b(t)xHP.  These  equations  (3.7)  have  the  same  form  as 
(3.2),  and  thus  we  see  that  the  equations  (3.2)  may  also  be  taken  to  model  force  attrition 
"above  a  unit's  breakpoint."  We  observe  that  for  the  transformed  force-level  variable  .V  defined 
by  (3.6),  X  =  0  corresponds  to  the  .V  force  reaching  its  breakpoint. 

The  force-level  trajectories  .v</>  and  yit)  for  the  model  (2.1)  [equivalently,  (3.2)  or 
(3  7)],  however,  no  longer  possess  a  very  important  mathematical  property  that  is  possessed 
[18]  by  all  solutions  to  (3.5)  with  ait)  and  bit)  ^  0  for  all  t  >  0  and  ,v0  and  y0  >  0:  namely, 
all  solutions  to  (2.1)  are  no  longer  nonoscillatory  in  the  strict  sense  that  xit)  and  yit)  can  how 
have  more  than  one  zero  between  them.  Here  it  is  convenient  [and  indeed  usually  implicitly 
done  when  one  works  with  analytical  solutions  such  as  (6.1  1)  and  (6.1.2)  below]  to  relax  the 
nonnegativity  restrictions  on  the  force  levels.  The  possible  lack  ol  such  nonoscillatory  behavior 
will  be  shown  by  considering  a  specific  example  below.  This  mathematical  properly  is  trouble¬ 
some  and  makes  analysis  ol  battles  modeled  with  .2.1)  much  more  difficult  than  those  modeled 
with  (3.5) 

4.  RK. PRESENTATION  OF  FORCE  LEVELS 

The  solution  to  the  inhomogeneous  differential-equation  combat  model  (2.1)  may  be  con¬ 
structed  out  of  the  same  "building  blocks"  that  may  be  used  to  construct  the  solution  to  the 
corresponding  homogeneous  system  ol  linear  differential  equations.  Thus,  the  A  lorce  level  as 
a  function  of  time,  xit) .  for  the  general  model  (2.1)  may  be  represented  as 
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v(/)  =  exp  H  |8(s)</sJJ  x0|cy(0)C>(r)  -  SQ(0)SP(t)  -  y(n/k^{CpiO)Sp(i)  -  SP(0)CF(t) 

(4.1)  +  V*r  f'~li CP(s)Sp(r)  -  Sp(s)CP(i))ds 

z\s) 

where  Z(/)  =  — A(t)S(t )  +  dR/dr  -  \R(t)/ A  (r)}dA/dt  =  {-a(t)s(t)  +  dr/di  -  r(t){(  1/ 
a(r))da/di  -  a(r)lexp/,)/3(s)(/s},  and  the  hyperbolic-like  general  Lanchesier  functions  (GLF) 
Cp(i),  Sp(i),  C^O),  and  Sq(i)  arise  from  the  corresponding  homogeneous  system  of  linear 
differential  equations,  which  is  considered  in  the  next  section.  Thus,  they  hyperbolic-like  GLF 
Cp(t)  and  Sp(l)  are  linearly-independent  solutions  to  the  P  force-level  equation  (5.4)  that 
satisfy  the  initial  conditions  (5.5),  and  CQ(t)  and  Sq(i )  are  similarly  defined.  The  above  result 
(4.1)  is  readily  developed  by  considering  (3.1)  and  applying  well-known  results  for  inhomo¬ 
geneous  linear  ordinary  differential  equations  (e  g.,  see  Hildebrand  [7,  pp.  29-30))  combined 
with  those  for  the  model  (5.1)  (20). 

5.  THE  MODEL  WITHOUT  REPLACEMENTS  AND  WITHDRAWALS 
(COMBAT  WITH  SUPPORTING  FIRES) 

An  important  special  case  of  the  general  linear  combat  model  (2.1)  is  that  in  which  there 
are  no  replacements  and  withdrawals,  i.e.,  r(t)  and  sir)  =  0,  and  in  this  case  our  combat 
model  becomes 

a(t)y  -  (i(i)x  with  x(0)  =  *0. 
b(t)x  -  a(t)y  with  v(0)  =  y0. 


An  essential  characteristic  of  all  solutions  to  (5.1)  (if  we  forget  for  the  time  being  about 
the  nonnegativity  restrictions  on  x  and  y)  is  their  so-called  nonoscillation  (here  in  the  strict 
sense). 

THEOREM  1:  At  most,  one  of  the  two  force  levels  x(r)  and  y(i)  for  the  model  (5.1) 
can  ever  vanish  in  finite  time. 


PROOF:  Multiplying  the  first  of  Equations  (5.1)  by  v.  the  second  by  x,  and  adding;  we 
obtain  after  some  straightforward  algebraic  manipulations 


-~{xy)  +  {or  ( / )  +  jS(r)  +  2Vo1 
dt 


\xy  =  -|Va  (r)y  -  VMTT.vp. 


whence  integration  yields 


r)y(r)  =  exp|-J"n  ja(s)  +  0(.s)  +  2Va  (s)M.s))</.vj  xav0 

2  f 

-  -Ja(s)yis)  -  Jhls)x(s)  exp  J"o  ( a(r)  +  (3(r) 
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The  theorem  now  follows  by  observing  that  the  second  multiplicative  factor  on  the  right-hand 
side  of  (5.2)  determines  the  sign  of  x(t)yit)  and  that  this  factor  can  change  sign  at  most  once. 
The  simultaneous  vanishing  of  both  xir)  and  y<f)  is  precluded  by  the  fact  that  if 
x(ra)  =  y(ra)  =  0  for  some  finite  ta  >  0.  then  the  uniqueness  of  solutions  to  (5.1)  would  imply 
that  x(t)  =  y(t)  =  0.  c  n 


The  significance  of  Theorem  1  is  that  the  mathematical  behavior  of  the  force  levels  is  funda¬ 
mentally  different  for  the  models  (2.1)  and  (5.1):  we  show  in  Section  6.2  below  by  the  example 
of  the  constant-coefficient  version  of  (2.1)  that  (again  ignoring  the  nonnegativity  restriction  on 
the  force  levels)  both  x(i)  and  vit)  can  become  zero  (and  then  negative)  for  the  general  linear 
model  (2.1).  Thus,  the  addition  of  replacements  and/or  withdrawals  to  the  model  (5.1),  i.e., 
take  r(r)  or  s(r)  ^  0  in  (2.1),  leads  to  fundamentally  different  behavior  for  the  force-level  tra¬ 
jectories. 

Another  reason  for  briefly  considering  this  model  without  replacements  and  withdrawals 
(5.1)  is  that  the  same  basic  building  blocks  may  be  used  to  construct  force-level  solutions  to 
both  (2.1)  and  (5.1).  Thus,  the  A' force  level  as  a  function  of  time,  xit).  for  the  model  (5.1) 
may  be  written  as  [20] 


(5.3)  -ytr/k^CptOlVf)  ~  V0)Q(r))i . 


[x0{c^(0)C/>(/)  -  vo)V')l 


where  the  hyperbolic-like  GLF  Cpit)  and  SP(i)  are  linearly-independent  solutions  to  the  P 
force-level  equation 

d2P 
Jt2 


(5.4) 

with  initial  conditions 

(5.5) 


fiit)  -  a(t)  +  ~  -  ait)bit)p  =  0, 

air )  dt  clt 


<?(/„)  Jt 


Cpit,)  =  1, 

dCp 

-(/o>  =  0. 


Spit( ))  —  0, 

1 


a  ( t,)  Jt 


The  GLF  (’y(r)  and  S^(t)  are  similarly  defined.  It  should  be  noted  here  that  the  above  result 
(5.3)  is  a  special  case  of  (4.1)  Further  details  and  analysis  of  the  model  without  replacements 
and  withdrawals  (5  1)  may  be  found  in  [17]  and  [201. 


6.  THE  GENERAL  LINEAR  MODEL  WITH  CONSTANT  ATTRITION-RATE 
COEFFICIENTS  AND  REPLACEMENTS 


In  the  case  of  constant  attrition-rate  coefficients  and  replacements,  the  general  linear 


model  <2  1 

)  reads 

J\ 

Jf 

~  a  i 

-  /jv 

+  r  with 

v(0)  =  An 

(6.1) 

ih 

dt  = 

-  hx 

—  a  \ 

+-  v  with 

>(())  =  v(). 

•  • 


i 


1 


•  '• 


•  •  •  • 
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where  a,  b,  a.  (J,  r.  and  s  denote  quantities  that  remain  constant  during  a  particular  haitle.  and 
we  assume  that  a  and  b  >  0.  while  a  and  fi  ^  0.  Although  there  are  several  ditTerem  sets  ol 
physical  circumstances  that  may  be  hypothesized  to  yield  (6.1)  (see  Section  2  above),  we  will 
consider  (6.1)  to  model  "aimed-fire"  combat  between  two  homogeneous  forces  with  supporting 
fires  not  subject  to  attrition  and  continuous  replacements/withdrawals.  In  this  case  we  should 
consider  rand  s  to  be  replacement  rates,  with  a  negative  value  denoting  a  net  rate  of  withdrawal 
of  forces.  Accordingly,  we  will  place  no  restrictions  on  the  replacement  rates  rand  s.  i  e  .  rand 
rare  unrestricted  in  sign. 

The  model  (6.1)  is  of  interest  because  it  provides  insights  into  the  consequences  of  addi¬ 
tional  troops  (continuously)  committed  to  battle.  We  may  consider  a  term  like,  for  example,  r 
to  represent  the  rate  at  which  additional  X  forces  are  committed  to  battle.  Another  related 
interpretation  is  that  r  represents  the  net  rate  at  which  the  X  force  enters  the  fields  of  fire  of  the 
Y  force.  Such  interpretations  essentially  apply  to  small-unit  combat  in  the  fire  fights.  We  may 
also  (see  Section  2  above)  however,  consider  (6.1)  to  model  combat  with  operational  losses  amt 
continuous  replacements.  In  this  case  we  may  consider  (6.1)  to  apply  to  large-scale  combat  over  a 
sustained  period  of  time,  and  then  /-and  s  represent  the  rates  at  which  additional  resources  are 
committed  to  the  theater  of  operations  (see  Morse  and  Kimball  [14,  pp.  71-73]).  In  this  light, 
analysis  of  this  combat  model  will  provide  important  insights  into  the  nature  of  tradeoffs  among 
(1)  direct  combat  capability,  (2)  "build-up"  capability,  and  (3)  operational  losses.  In  terms  of 
the  NATO  scenario,  the  model  (6.1)  provides  rough  insights  into  the  structure  of  tradeoffs 
among  the  quality  of  weapon  systems,  the  quantity  of  weapon  systems,  and  the  "build-up"  rates 
at  which  new  systems  are  introduced  into  the  theater  of  operations. 

6.1.  Force-Level  Results 


Unlike  the  previous  variable-coefficient  versions  considered  above,  the  constant-coefficient 
model  (6  1)  yields  an  analytical  solution  that  is  simple  enough  to  provide  some  important 
insights  into  the  dynamics  of  combat  through  direct  analysis.  When  ab  ^  a/3,  the  X  and  ) 
force  levels  \  <  ? )  and  t  (?)  foi  the  model  (6.15  17  arc  given  by* 


(6.1.1) 


,v(?)  =  £  +  Ae'“' ,r"  + 


0  +  6 


Be-'^'TU. 


and 

(6.1.2) 


(?)  = 


o+J> 

a 


Ac"1  'r,;  +  Be 


where 

(6.1.3) 


■I  = 


_ ah 

2tt(u  4-  6) 


??  +J> 
~  b 


»  « 


I  • 


»  « 


•  '• 


•  -• 


•  • 
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vorruci  h.t'tk’  'tuu«nn:  pi  oIk  '  o  ••  -  -'ii.'.'mus  w  u  c  implu.nl  in  [14.  p  '?]  I  uri  he?  mure.  ;  h  l-  sokiimn  K :  .  i  \  i<  .1  - 1 ; . » vs :  i  K 
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(6.1.4) 

(6.1.5) 

(6.1.6) 


B 


ah 


2H(f)  +8) 


H  +  8 


a 

hr  —  [is 


,  as  —  ar 


I'll  -  V  +  (.Vo  -  T)) 
A  =  ab  -  otfi, 


H  =  Vc/A  +  82,  8  =  @  y  U  •  and  <r  = 


a  +  (3 


Let  us  also  note  the  following  identity 


(6  1.7) 


(i  +  8 


=  V*' 


t* 


+  1 


where 

(6.1.8) 


a 


R  =  -  and  .S'  = 
6 


The  parameters  R  and  S  have  interesting  military  interpretations,  and  the  interested  reader  can 
find  a  discussion  about  them  in  Taylor  and  Parry  [21,  p.  5271,  Taylor  [16,  p.  7891,  or  Taylor 
[17,  P.  3681. 


When  ah  =  afi.  the  X  and  Y  force  levels  x(t)  andyO)  for  the  model  (6.1}  are  given  by 


a-  ( / )  =  xlle~'a+l3)'  + 


ar  —  as 


a  +  (3 


(6.1  9) 
and 


(fir  +  as)  | 

a  x  o  -  ayn 

(a  +  fi)2 

a+B 

(1  -  e-,"+/3h}, 


>■  ( I  )=.VnC  — 


(6.1.10) 


(as  +  hr) 


(a  + 


h 

ar 

—  as 

a 

1  «  +/3  ) 

) 

b 

ax  a 

1 

a 

a 

In  this  latter  case.  i.e..  when  ah  =  afi.  the  constant-coefficient  combat  model  (6.1)  possesses 
the  state  equation 


(6  1.11)  A  (.v„  -  v)  =  0(vii  -  .»  )  +  (fis  -  hr )  i. 

which  yields  the  overall  casualty-exchange  ratio  is  constant,  i.e., 

T  _  B 


u>  I  13t 


v,  h  ' 

Valid  >  casualties  are  given  by 

\  —  V,,  E  rt  ■  v.  and  i,  =  i ,,  f  s r  —  )  . 
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Let  us  observe  that  in  all  cases  the  instantaneous  casualty-exchange  ratio  dx/dy  is  given  by 


(6.1.14) 


dx  _  /3 
dy  b 


+ 


B  A 

b  S~  Jy 


s  —  bx  —  av 


which  for  ab  =  a/3  becomes 

r  —  (/3/b)s 
s  —  bx  —  ay 

In  particular,  for  br  =  /3s  and  ab  =  a/3  we  have  the  linear  law 
(6.1.16)  b  (x0  -  x)  =  /3(  Vo  -  >’)• 


(6.1.15) 


dy  b 


Let  us  finally  sketch  the  development  of  the  above  expressions  for  the  force  levels  x(t) 
andy(t).  When  ab  5*  a/3,  we  may  write  (6.1)  as 


(6.1.17) 


dt  =  -a(y  -  Tj)  -  /3(x  -  £) 
dy 

dt  =  -6(.v  -  £)  -  a(y  -  q). 


whence  the  substitution 


(6.1.18)  T(r)  =  x(t)  -  £  and  Y(r)  =  y(r)  -  q 


transicrms  (6.1.17)  into 


(6.1.19) 


~  =  -aY  ~  (3X  with  X (0)  -  jc0  - 
dt 

dY 

dr  =  —bX  ~  aY  with  E(0)  =  y0  —  q, 


for  which  we  have  given  a  solution,  i.e.,  expressions  for  X(t)  and  K(/),  in  [20,  p. 
also  Bach,  Dolansky,  and  Stubbs  [1]).  When  ab  =  a/3,  we  may  write  (6.1)  as 


(6.1.20) 


=  r  -  /3 

,  a 

x  +  —  y 

dt 

b 

±.  , 

dt  =*  s  —  b 

x  +  -b-v\ 

with  x(0)  =  x0, 
with  y  (0)  =  y0, 


118)  (see 


whence  follows  (6.1.11).  The  stale  equation  (6.1.11)  may  then  be  used,  for  example,  to  elim¬ 
inate  y  from  the  first  of  Equations  (6.1.20),  and  hence  the  expression  (6.1.9)  for  x(i)  may  be 
obtained  by  a  simple  integration. 


6.2.  Theoretical  Analysis  of  Solution  Behavior 


Determination  of  the  qualitative  behavior,  e  g.,  battle-outcome  prediction  conditions,  for 
the  linear  combat  model  with  replacements  (6.1)  is  much  more  difficult  than  we  have  hereto¬ 
fore  encountered  because  the  force  levels  x(i)  and  y(r)  no  longer  possess  a  very  important 
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mathematical  property  that  facilitated  analysis  of  combat  modeled  with  Lanchester’s  equations 
for  modern  warfare  (3.5):  namely,  all  solutions  to  (6.1)  are  no  longer  nonoscillatory  in  the 
strict  sense  that  x(t)  and  y(r)  can  have  more  than  one  zero.  We  will  give  an  example  of  such 
solution  behavior  below.  However,  analysis  of  the  qualitative  behavior  of  the  model  (6.1)  is 
relatively  straightforward  when  ab  >  a/3,  i.e.,  the  intensity  of  combat  between  the  primary  sys¬ 
tems  exceeds  the  "intensity"  of  the  supporting  fires,  although  the  above  analytical  results  for 
x (t)  and  yU),  e.g.,  (6.1.1)  and  (6.1.2),  are  nevertheless  complicated  enough  so  that  direct 
analysis  of  them  has  not  proven  completely  fruitful.  We  will  now  develop  battle-outcome  predic¬ 
tion  conditions  for  a  fight-to-the-finish  for  this  case  in  which  ab  >  a/3. 

We  begin  with  a  few  heuristics  to  provide  some  intuitive  understanding  of  our  results. 
Let  us  first  observe  that  0  —  <r  >  0  if  and  only  if  ab  >  a/3.  Hence,  in  this  case  the  exponen¬ 
tial  eil>~'j)l  >n  (6.1.1)  and  (6.1.2)  is  a  strictly  increasing  function  that  grows  without  bound. 
Furthermore,  the  signs  of  x(t)  and  y(t)  for  large  t  are  opposite  and  determined  by  the  sign  of 
A.  However,  further  analysis  and  a  few  additional  assumptions  are  required  to  convert  these 
preliminary  heuristics  into  mathematically  precise  battle-outcome  prediction  results.  It  is 
worthwhile  to  finally  observe  that  then  A  =  0,  i,e.,  (x0  —  0  =  Cvo  —  tj)(0  +  6)/ 6,  (6.1.1)  and 
(6.1.2)  reduce  to 

x(t)  =  x0  +  f{l  -  e~(,>+a}'}, 

(6.2.1)  y(t)  =  y0  e'w+<7’'  +  tj{1  - 

Consequently,  one  must  have,  for  example,  ^  >  0  in  order  that  x(t)  >  0  for  all  finite  t  ^  0. 
Thus,  we  should  anticipate  having  to  make  some  sort  of  assumption  about  f  and  tj  in  order  to 
develop  battle-outcome  prediction  conditions. 

The  first  step  in  the  development  of  these  conditions  is  to  generalize  the  nonoscillation  of 
solutions  to  Lanchester-type  equations  for  modern  warfare  to  the  case  of  the  general  linear 
model  witn  constant  attrition-rate  coefficients  and  replacements  (6.1)  (cf.  Taylor  [18,  Theorem 
1]  or  Taylor  [20,  Result  4.1]). 

THEOREM  2:  Assume  that  ab  ^  a/3.  Then  for  all  finite  /  ^  0 

(6.2.2)  either  x(r)  >  £  ory(r)  >  17. 

If  either  of  conditions  (6.2.2)  is  violated  at  some  finite  time,  then  the  other  holds  for  all  time  1 
>  0. 


PROOF:  Recalling  that  the  substitution  (6.1.18)  transforms  (6.1.27)  into  (6.1.19)  ,  we 
observe  that  Equations  (6.1.19)  are  of  the  same  mathematical  form  as  those  for  the  model 
without  replacements  and  withdrawals  (5.1).  Consequently,  we  may  apply  Theorem  1  to 
(6.1.19)  and  conclude  that  XU)  and  Y(t)  between  them  have  at  most  one  zero  for  all  finite 
time  /  ^  0.  In  other  words,  either  X(t)  or  Y(t)  >  0  for  all  finite  t  ^  0,  i.e.,  by  (6.1.18)  we 
have  shown  that  (6.2.2)  holds  for  all  finite  t  ^  0.  Furthermore,  if  (for  example)  XU)  is  equal 
to  zero  at  some  finite  time,  we  know  from  the  proof  of  Theorem  1  that  Y(t)  >  0  for  all  t  >  0, 
whence  follows  the  last  part  of  the  theorem.  n  f  n 


•  • 


•  • 


I  • 


»  • 


We  can  now  develop  rather  precise  battle-outcome  prediction  conditions  if  we  assume  that 
both  £  and  tj  ^  0. 


I  • 
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THEOREM  3:  Assume  lhat  ab  >  afi  and  that  both  £  and  17  >  0.  Then  the  X  force  will 
lose  a  fight  to  the  finish  in  finite  time  (i.e.,  will  be  annihilated  first  in  finite  time)  if  and  only  if 


(6.2.3) 


(*o  ~  O  < 


9  +  5 
b 


(To  ~  v). 


which  may  also  be  written  in  the  equivalent  form 


(6.2.4) 


Oc0-  0  <  yfR1 


Cvo  -  ■*/)• 


Furthermore,  when  (6.2.4)  holds,  we  have  that  for  all  /  ^  0 

(6.2.5)  y(t)  >  -n  >  0. 

Neither  side  will  be  annihilated  in  finite  time  if  an  only  if 


(6.2.6) 


(*o  -  O  -  V* 


(To  “  vh 


and  in  this  case 

(6.2.7)  lim  x(r)  ==  £  ^  0  and  lim  y(t)  —  r>  >  0. 

/— +00  /— - f-00 


PROOF:  Sufficiency  of  (6.2.4)  for  the  annihilation  of  the  X  force  is  proven  by  observing 
that  9  -  cr  >  0  if  and  only  if  ab  >  a(3.  Consequently,  we  see  from  (6.1.1)  that  the  X  force 
level  x(r),  which  is  a  continuous  function  of  time,  becomes  negative  for  t  large  enough  when 
the  coefficient  A  in  (6.1.1)  is  negative,  i.e.,  (6.2.3)  holds.  Hence,  (6.2.3)  holding  [equivalently, 
(6.2.4)]  guarantees  that  the  X  force  will  be  annihilated  in  finite  time,  and  then  the  last  line  of 
Theorem  2  guaranteees  that  (6.2.5)  holds.  Necessity  of  (6.2.3)  for  the  annihilation  of  the  X 
force  (and  also  the  statements  concerning  the  nonannihilation  of  either  side)  is  proven  by 
observing  from  (6.1.1)  and  (6.1.2)  that  when  A  ®  0,  i.e.,  (6.2.6)  holds,  we  have  via  (6.2.1) 
that  x(t)  and  y(t)  >  0  for  all  finite  t  >  0.  This  result  shows  that  X  cannot  be  annihilated 
when  A  =  0  and  proves  the  nonannihilation  statements  involving  (6.2.6)  and  (6.2.7),  since 
(6.2.1)  is  readily  seen  to  yield  (6.2.7).  If  A  >  0,  then  (6.1.2)  shows  us  that  y(t)  has  a  finite 
zero  point,  since  9  +  8  >  0.  Consequently,  y(t)  can  be  made  to  be  negative  for  t  large 
enough,  and  Theorem  2  then  yields  that  *(/)>£>  0  for  all  finite  t  >  0.  Thus,  we  have 
shown  that  x(r)  can  vanish  only  if  A  <  0,  i.e.,  (6.2.4)  is  necessary  for  the  X  force  to  be  annihi¬ 
lated  in  finite  time.  _  _  ^ 


Still  assuming  that  both  f  and  73  >  0,  we  see  that  the  Y  force  will  be  annihilated  first  in 
finite  time  if  and  only  if  the  above  inequality  (6.2.4)  is  reversed.  Furthermore,  it  should  be 
clear  that  the  assumption  that  both  |  and  17  >  0  plays  an  essential  role  in  these  battle-outcome 
prediction  conditions.  For  example,  the  inequality  (6.2.4)  holding  for  the  initial  force  levels 
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guarantees  via  Theorem  2  that  y(t)  >0  for  all  t  >  0  only  when  17  >  0  as  the  numerical  exam¬ 
ple  depicted  in  Table  I  shows.*  In  this  numerical  example,  the  condition  (6.2.4)  is  satisfied  and 
we  consequently  still  have  that  for  all  t  ^  0 


(6.2.8)  >-(/)  >  7) , 

but  (6.2.8)  no  longer  guarantees  that  the  Y  force  will  never  be  annihilated  when  tj  <  0.  Thus, 
(6.2.4)  being  satisfied  no  longer  guarantees  that  Y  will  win  a  fight  to  the  finish  (i.e.,  a  battle 
that  lasts  until  one  side  or  the  other  has  been  annihilated)  when  17  <  0.  In  other  words, 
(6.2.4)  is  satisfied  for  the  particular  battle  depicted  in  Table  I,  but  the  Y  force  is  actually  annihi¬ 
lated  before  the  X  force  is.  Again,  the  reason  why  (6.2.4)  fails  to  correctly  predict  battle  out¬ 
come  is  that  t)  <  0.  This  numerical  example  should  alert  the  reader  to  the  fact  that  determina¬ 
tion  of  the  qualitative  behavior,  e.g.,  battle-outcome  prediction,  for  the  constant-coefficient 
model  with  a  constant  rate  of  replacement/withdrawal  is  in  some  sense  much  trickier  for  the 
variable-coefficient  model  with  no  replacements/withdrawals  (5.1). 

Moreover,  we  can  relax  somewhat  our  assumption  concerning  the  signs  of  f  and  tj  and 
still  develop  sufficient  condition  for  a  side  to  win  a  fight  to  the  finish. 

THEOREM  4:  Assume  that  ab  >  a/3  and  that  t)  >  0.  Then  the  X  force  will  lose  a  fight 
to  the  finish  (i.e.,  a  fight  that  lasts  until  the  annihilation  of  one  side  or  the  other)  in  finite  time 
if  (6.2.4)  holds.  We  also  have  for  this  battle  that  y(/)  >  17  >  0  for  all  t  ^  0. 

PROOF:  Very  similar  to  the  proof  of  Theorem  3,  since  A  <  0,  we  know  from  (6.1.1)  that 
x(f)  becomes  negative  "or  t  large  enough,  and  hence  x(i)  has  a  finite  zero  point.  Since  x(r) 
can  always  be  made  to  violate  (6.2.2)  by  taking  t  large  enough,  Theorem  2  tells  us  that 
y(r)  >  17  ^  0  for  all  /  >  0. 


However,  as  the  example  depicted  in  Table  I  shows,  "if-and-only-if”  statements  regarding  battle 
outcome  can  no  longer  be  made  when  we  assume  that  only  one  of  £  and  17  >  0  (in  the  numeri¬ 
cal  example  under  consideration,  only  £  >  0). 

7.  DISCUSSION 

The  results  that  we  have  given  in  this  paper  concerning  the  effects  of 
replacements/ withdrawals  on  battle  outcome  not  only  are  of  interest  in  their  own  right,  but  they 
are  also  important  for  understanding  the  behavior  of  complex  operational  Lanchester-type  com¬ 
bat  models  that  are  widely  used  today  in  both  the  United  States  (e  g.,  see  [2],  [4],  [19),  [22]) 
and  also  NATO  countries  [8],  [9).  In  particular,  a  simple  combat  model  such  as  we  have  con¬ 
sidered  here  may  yield  a  clearer  understanding  of  important  relations  that  are  difficult  to  per¬ 
ceive  in  such  a  complex  model.  Furthermore,  the  mathematical  combat  model  that  we  have 
studied  here  is  important  not  only  for  the  significance  of  particular  modeling  interpretations  but 
also  for  its  versatility  (see  Section  2  for  further  details).  In  particular,  its  interpretation  as 


’Mere,  we  have  computed  ,v(/)  and  y(t)  as  functions  of  time  from  (6.1.1)  and  (6.1.2)  and  have  ignored  the  nonne¬ 
gativity  restriction  on  them  when  the  differential  equations  (6,1)  are  viewed  as  a  Lanchester-type  combat  model.  In  ac¬ 
tual  application,  however,  one  would  terminate  the  battle  when  first  y  —  0.  Moreover,  in  practice  one  decides  to  do 
this  by  computing  the  force-level  trajectories  x(t)  and  y(l)  from  (6  11)  and  (6.1  2).  Thus,  one  is  interested  in  these 
formal  mathematical  solutions  without  consideration  of  the  physical  restriction  on  negative  force  levels. 
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TABLE  I  — Example  that 
Shows  that  One  Must  Have 
both  f  and  tj  ^  0  in  Order 
for  the  Inequality  (6.2.4) 
to  Correctly  Predict 
a  Y  Victory 

in  a  Fight-lo-the-Finish. 


1 

,v(r) 

y(t) 

0.00 

200.00 

60.00 

0.1 

172.26 

33.31 

0.2 

151.52 

13.73 

0.3 

135.94 

-0.56 

0.4 

124.17 

-10.92 

0.5 

115.19 

-18.33 

0.6 

108.25 

-23.53 

0.7 

102.79 

-27.07 

0.8 

98.40 

-29.35 

0.9 

94.76 

-30.65 

1.0 

91.64 

-31.18 

1.1 

88.85 

—31.11 

1.2 

86.27 

-30.53 

1.3 

83.78 

-29.53 

1.4 

81.30 

-28.15 

1.5 

78.76 

-26.43 

1.6 

76.10 

-24.37 

1.7 

73.27 

-21.99 

1.8 

70.23 

—  19.28 

1.9 

66.92 

-16.22 

2.0 

63.31 

-12.79 

2.1 

59.36 

-8.98 

2.2 

55.02 

-473 

2.3 

50.23 

-0.02 

2.4 

44.96 

5.19 

2.5 

39.15 

10.97 

2.6 

32.72 

17.36 

2.7 

25.63 

24.43 

2.8 

17.80 

32.35 

2.9 

9.15 

40.89 

3.0 

-0.41 

50.44 

3.1 

-10.98 

61.00 

3.2 

-22.66 

72.67 

3.3 

-35.56 

85.57 

3.4 

-49.82 

99.82 

3.5 

-65.57 

115.58 

NOTE:  In  this  battle  we  have 
taken  (in  compatible  units) 

a  ■■  6  -  2.  a=*/3-l,r-0. 
and  S  —  150.  It  follows  that 
(6.2.4)  is  satisfied  but  with  £  ” 
100  and  r)  —  -50. 
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representing  a  simple  model  of  large-scale  combat  over  a  sustained  period  of  time  with  replace¬ 
ments  continuously  committed  to  battle  generates  particular  interest  within  the  framework  of 
the  NATO  scenario  with  regard  to  rough  insights  into  the  structure  of  tradeoffs  among  the  qual¬ 
ity  of  weapon  systems,  the  quantity  of  weapon  systems,  and  the  "build-up"  rates  at  which  new 
systems  are  introduced  into  a  theater  of  operations. 

Unfortunately,  the  general  linear  model  with  replacements/withdrawals  (2.1)  is  much 
more  difficult  to  analyze  than  its  simpler  versions  (3.5)  and  (5.1)  previously  studied  (e.g.,  see 
[161  for  further  details  and  references;  also  [171,  (201).  We  show  how  the  general  linear  model 
(2.1)  could  be  transformed  to  a  simpler  canonical  form  (3.2),  but  unfortunately  this  canonical 
form  has  not  been  much  studied.  We  did  see,  however,  that  a  fixed-force-level-breakpoint  bat¬ 
tle  modeled  with  Lanchester-type  equations  for  modern  warfare  (3.5)  does  lead  to  this  same 
canonical  form.  More  importantly,  solutions  to  the  canonical  form  (3.2)  were  shown  to  exhibit 
qualitative  behavior  (at  least  in  special  cases)  markedly  different  from  that  for  Lanchester-type 
equations  of  modern  warfare  (3.5).  In  particular,  nonoscillation  in  the  strict  sense  was  shown 
not  to  hold  for  solutions  to  (at  least  the  constant-coefficient  version  of)  (2.1). 

We  showed  how  to  represent  the  force  levels  for  the  model  (2.1)  in  terms  of  general  Lan- 
chester  functions  corresponding  to  the  model  without  replacements/withdrawals,  but  these 
analytical  results  do  not  appear  to  be  of  much  practical  significance  for  developing  insights  into 
the  dynamics  of  combat.  More  importantly,  we  showed  in  Theorem  1  that  all  solutions  to  this 
model  without  replacements/withdrawals  had  the  property  of  being  nonoscillatory  (in  the  strict 
sense)  in  marked  contrast  to  that  of  those  to  the  general  model  (2.1)  (recall  the  example  dep¬ 
icted  in  Table  I).  This  analysis  of  solutions  to  (5.1)  turned  to  be  be  particularly  significant 
when  we  saw  that  it  could  be  applied  to  the  constant-coefficient  version  of  (2.1). 

In  hopes  of  developing  insights  into  the  dynamics  of  combat,  we  turned  to  the  general 
linear  model  with  constant  attrition-rate  coefficients  and  replacements  (6.1)  because  of  the 
apparent  opacity  of  analytical  results  for  its  variable-coefficient  version.  Unfortunately,  analyti¬ 
cal  results  for  the  force  levels  to  (6.1)  turned  out  to  be  so  complex  (although  wholly  and  expli¬ 
citly  expressible  in  terms  of  elementary  functions,  i.e.,  exponentials)  that  we  were  unable  to 
determine  the  qualitative  behavior  of  the  force-level  trajectories  by  direct  examination  of  these 
analytical  results.  However,  we  were  able  in  Theorem  2  to  generalize  the  nonoscillation  of 
solutions  to  this  model  by  transforming  its  equations  (6.1)  into  those  for  combat  with  only  sup¬ 
porting  fires  (5.1)  [recall  Equations  (6.1.19)1.  Theorem  2  was  particularly  significant  because 
we  were  able  to  develop  the  battle-outcome  prediction  conditions  of  Theorems  3  and  4  by  using 
it. 


The  complexity  of  solution  behavior  for  .he  constant-coefficient  model  (6.1)  was 
apparently  not  appreciated  in  the  earlier  work  of  Morse  and  Kimball  [141,  and  consequently 
incorrect  battle-outcome  prediction  conditions  were  implied  in  [14,  p.  72]  (recall  the  example 
depicted  in  Table  I).  Because  of  our  previous  theoretical  examinations  (particularly  with 
respect  to  the  nonoscillation  of  solutions)  of  the  simpler  models  (j.5)  and  (5.1),  we  had 
developed  enough  analytical  machinery  to  profitably  study  the  more  complex  model  at  hand 
(2.1)  [especially  its  constant-coefficient  version  (6.1)].  Such  results  were,  of  course,  not  avail¬ 
able  when  Morse  and  Kimball  wrote  their  pioneering  book  on  military  operations  research  [14]. 
Finally,  the  reader  should  note  that  we  have  shown  how  various  Lanchester-type  combat 
models  are  transformable  into  each  other  and  consequently  how  results  for  simpler  models  can 
be  adapted  to  the  study  of  more  complicated  ones. 
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8.  CONCLUSIONS 

(I)  Analytical  results  for  Lanchesler-type  models  with  continuous  replacements 
and/or  withdrawals  are  much  more  complicated  than  those  for  models  without 
replacements/withdrawals.  The  complexity  of  these  analytical  results  is  such  that 
general  results  concerning  the  qualitative  behavior  of  solutions  in  geneial  is 
required  for  the  study  of  theier  qualitative  behavior  in  specific  instances. 

(II)  The  problem  of  studying  a  fixed-force-level-breakpoint  battle  modeled  with 
Lanchester-type  equations  for  modern  warfare  is  mathematically  equivalent  to  that 
of  studying  a  fight  to  the  finish  with  continuous  withdrawals  on  both  sides. 

(Ill)  The  force-level  trajectories  for  the  so-called  general  linear  model  with  continuous 
replacements/withdrawals  behave  in  a  fundamentally  different  way  (at  least  in 
special  cases  such  as  constant  coefficients)  than  do  those  for  the  corresponding 
model  without  replacements/withdrawals  (i.e.,  so-called  combat  with  (only)  sup¬ 
porting  fires).  In  particular,  for  the  general  linear  model  with  continuous  replace¬ 
ments,  both  sides  may  successively  be  annihilated  in  battle  if  the  force-on-force 
attrition  equations  are  not  "turned  off'  at  the  right  time,  and  this  phenomenon 
makes  battle-outcome  prediction  particularly  difficult. 
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ABSTRACT 

The  power  of  the  component  randomization  lest  for  the  paired  sample  loca¬ 
tion  test  is  compared  to  the  power  of  the  parametric  test  for  paired  comparisons 
using  the  sum  of  differences  distribution.  This  analysis  is  presented  in  the  form 
of  operating  characteristic  curves  for  normal,  exponential,  uniform  and  abso¬ 
lute  lambda  observations  over  a  range  of  small  sample  sizes. 


1.  INTRODUCTION 

Little  study  of  the  power  of  component  randomization  tests  for  paired  comparisons  has 
been  undertaken  for  finite  sample  sizes.  Knowledge  of  the  power  is  important  for  evaluating 
the  tradeoffs  between  the  robustness  and  computational  ease  of  the  various  tests.  The  com¬ 
ponent  randomization  test,  the  most  powerful  distribution-free  test,  has  the  disadvantage  of 
requiring  great  computational  effort.  The  corresponding  parametric  test,  which  assumes 
knowledge  of  the  exact  form  of  the  distribution  of  the  test  statistic,  would  be  expected  to  have 
more  power  than  distribution-free  tests  which  do  not  make  use  of  this  information,  but 
knowledge  of  this  distribution  is  rare.  The  normal  lest  is  of  interest  because  it  is  widely  used 
and  uniformly  most  powerful  in  the  case  of  normally  distributed  observations. 

Quantifying  the  difference  in  power  between  the  first  two  tests  is  helpful  in  deciding 
which  test  to  use  in  a  particular  situation.  The  power  of  the  normal  test  under  the  null 
hypothesis,  which  is  the  probability  of  type  I  error,  is  important  in  determining  the  degree  of 
inappropriateness  of  the  normal  test  for  various  underlying  distributions. 

In  Deutsch  and  Schmeiser  (1],  (2],  methods  were  developed  to  study  the  component  ran¬ 
domization  test  and  the  exact  corresponding  parametric  test,  respectively.  The  methods  are 
general  in  that  they  are  appropriate  for  any  underlying  distribution  and  sample  size.  These 
techniques  are  implemented  here,  using  also  the  methods  of  quantile  estimation  discussed  in 
Deutsch  and  Schmeiser  [3]  and  of  process  generation  discussed  in  Schmeiser  and  Deutsch  [4], 
to  estimate  the  power  of  various  tests  of  location  for  the  paired  sample  design. 
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Specifically,  the  tests  considered  are  the  component  randomization  test,  the  corresponding 
parametric  lest,  and  the  commonly  used  normal  test  The  "corresponding  parametric  test"  is  to 
reject  H{)  if  the  test  statistic  is  greater  than  the  (l-«)  quantile  of  the  distribution  of  the  test 
statistic.  In  the  case  of  normally  distributed  obset cations,  the  parametric  test  and  the  normal 
test  are  one  and  the  same. 

In  the  next  section,  an  overview  of  the  analysis  performed  for  each  underlying  distribu¬ 
tion  and  sample  size  in  later  sections  is  described.  In  Section  3  an  analysis  of  the  power  of 
these  tests  under  several  specific  distributions  and  sample  sizes  is  presented.  Lastly,  in  Section 
4,  conclusions  of  the  analyses  are  delineated. 

2.  ANALYSIS  METHODOLOGY 

Before  looking  at  specific  results,  the  steps  necessary  to  determine  the  powers  of  the  tests 
of  interest  are  now  discussed.  Conceptually,  the  approach  is  straightforward,  requiring  only  two 
steps: 

(a)  Determine  F~](p ),  the  /rth  quantile  of  the  reference  distribution  F  for  the 
parametric  test.  The  power  of  the  parametric  test  or  the  normal  test  may  then  be 
determined  from  1  —  fS  =  1  —  F(Ca  —  yfn E{dja})  where  Ca  is  the  (1-q)  quan¬ 
tile  of  For  the  normal  distribution. 

(b)  Generate  N  samples  of  size  n  from  the  distribution  of  interest  with  specified 
expected  differences.  For  each  sample  perform  the  four  tests  and  update  a 
counter  for  each  test  which  rejects  Hn.  The  count  divideu  by  /V  is  then  the  esti¬ 
mate  of  the  power  of  the  test. 

Thus  the  power  of  the  parametric  tests  may  be  determined  from  either  phase.  Note  that 
the  power  of  the  component  randomization  test  may  be  determined  only  from  Phase  2. 

3.  THE  POWER  OF  CERTAIN  TESTS 

In  this  section  the  results  arising  from  the  techniques  developed  previously  are  presented. 
The  probability  of  type  II  error.  /3.  which  is  one  minus  the  power,  is  given  in  both  tabular  and 
graphical  form  for  each  of  the  tests  tor  various  underlying  distributions'  and  sample  sizes.  The 
aim  of  this  section  is  to  examine  some  specific  examples  to  determine  the  relationship  of  power 
to  other  factors  in  a  general  way.  The  normal,  exponential,  uniform,  and  absolute  lambda  cases 
are  examined.  These  distributions  were  selected  to  illustrate  the  effect  of  tail  weight  and  shape 
on  the  power  of  the  tests. 

In  each  section  results  arc  given  in  the  form  of  operating  characteristic  (O.C.)  curves. 
The  curves  for  sample  sizes  4,  7,  11,  and  15  arc  grouped  by  underlying  distribution  and  a 
value,  the  values  of  a  considered  tire  i/128,  1/16,  and  1/8.  the  closest  values  to  .01.  .05.  and 
.1  of  the  form  ml 2"  where  n  is  integer  These  values  are  necessary  since  <*  can  take  on  only 
values  of  this  form  for  permutation  tests. 

Normal  Observations 

In  this  section  the  power  of  the  tests  of  interest,  presented  in  the  form  of  O.C  curves,  are 
compared  for  the  case  of  normally  distributed  observations.  Since  the  appropriate  parametric 
test  is  the  normal  test,  only  the  component  randomization  and  normal  tests  are  of  interest  here. 
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Figure  1  shows  graphically  the  results  of  this  section,  which  are  also  presented  in  tabular 
torm  in  Table  1.  Several  observations  can  be  made  from  the  graphs  corresponding  to  a  = 
1/128,  1/16,  and  1/8,  respectively: 
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TABLE  1  —  Type  II  Errors  (fi)  of  Randomization  and  Parametric 
Test  (Standard  Deviations  Shown  in  Parentheses) 


1/128  =  .0078125 


7 

.991 

.991 

ii 

.991 

.991 

15 

.991 

.991 

a  = 

5 

.93(  0046) 

.71  (.0083) 

.38L0088) 

§  • 

5 

.86 

■4I 

.06 

5 

.  84  (.008 1 ) 

.35  (.0 1 5 ) 

.036(  0059) 

5 

.78 

.18 

.005 

5 

.76  (.017) 

.15(013) 

neg 

5 

.69 

.07 

neg  1 

i 

- 1 

»  • 

=  1/16  =  .0625z„  =  1.534 


.77(  0066) 

.50(  0079) 

.24(  0067) 

.70 

.32 

07 

.64L0088) 

.19(  0072) 

.02(0026) 

•  -• 

.58 

.13 

.01 

49(.01 1 ) 

.051  (.0070) 

neg 

.45 

.04 

neg 

.38(  020) 

.012(  0040) 

neg 

.35 

.01 

neg 

•  . 

a  -  1/8  -  . 


,075(.0042) 

.03 

.005  (.00 13) 
.002 
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1.  The  value  of  /3  decreases  as  sample  si/e  increases. 

2.  The  value  of  /3  decreases  as  E\dJ a)  increases. 

3.  The  value  of  ji  decreases  as  a  increases. 

4.  The  normal  test,  which  dominates  the  component  randomization  test  in  this  case 

of  normally  distributed  observations,  has  asymptotically  the  same  power  as  the 
component  randomization  test.  The  dominance  of  tl.w  normal  test  decreases  as  a 
increases  and  as  n  increases.  This  is  logical  since  many  observations  are  needed 
for  the  tail  areas  of  the  randomization  reference  distribution  to  be  well  developed. 
An  a  value  far  out  in  the  tail  requires  more  observations  for  the  same  power. 
Thus,  for  normally  distributed  observations,  the  largest  loss  of  power  in  using  the 
component  randomization  test  is  for  small  n  and  small  a. 

Exponential  Observations 

Results  are  given  here  for  exponentially  distributed  observations.  The  form  of  the  results 
is  the  same  as  for  the  normal  case  of  the  last  section,  except  here  the  parametric  test  is  distinct 
from  the  normal  test. 

Figure  2  shows  the  parametric  reference  distribution  for  the  parametric  test  under 
exponential  observations  as  determined  by  Phase  1.  The  upper  half  of  the  reference  distribu¬ 
tion  is  plotted  on  normal  probability  paper  for  sample  sizes  n  =  4,  7,  11,  15.  and  infinity.  The 
reference  distribution  as  n  approaches  infinity  is  the  normal  distribution  (the  straight  line  in  1  he 
figure)  with  the  difference  between  the  normal  distribution  and  the  parametric  reference  distri¬ 
bution  for  finite  sample  sizes  being  greatest  for  small  samples.  For  all  sample  sizes  the  normal 
distribution  lies  above  the  parametric  reference  distributions  above  approximately  the  95% 
quantile  and  below  the  parametric  reference  distribution  otherwise. 

Table  2  contains  the  results  of  this  section  in  tabular  form.  Three  tables  are  shown, 
corresponding  to  «  =  1/128.  1/16,  and  1/8.  respectively.  Each  of  the  entries  are  the  probabil¬ 
ity  of  type  II  error,  /3,  for  various  values  of  n  and  E{cija\.  Results  for  the  component  random¬ 
ization  test  are  in  the  columns  labeled  "R."  Similarly  the  parametric  test  results  are  under  "P" 
and  the  normal  lest  results  are  under  "N."  For  the  randomization  test  the  estimate  of  the  stan¬ 
dard  devia'ion  of  the  result  is  given  in  parentheses.  The  results  for  "P"  and  "N"  are  determinis¬ 
tic  calculations  from  the  results  of  Phase  1.  The  critical  values  used  for  ti  e  parametric  test  are 
determined  from  Figure  2  and  are  given  in  the  right-hand  column  under  C,  . 

The  most  surprising  aspect  of  these  results  is  that  the  power  of  t,.c  parametric  test  does 
not  dominate  the  power  of  the  component  randomization  test.  Especially  for  alternative 
hypotheses  close  to  //„,  the  component  randomization  test  has  more  power  f<  r  all  n  and  values 
of  o.  At  first  glance  this  is  counter-intuitive,  since  the  usual  circumstance  is  'hat  power  is  lost 
in  obtaining  distribution-free  properties.  Note  that  over  the  range  of  values  of  E[<l,/<r\,  how¬ 
ever,  the  parametric  test  is  indeed  more  powerful. 

An  intuitive  rationale  for  tire  greater  power  of  the  component  randomization  test  lot  alter¬ 
native  hypotheses  close  to  the  null  hypothesis  is  as  follows:  the  reference  distribution  for 
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Figure  2.  Reference  distributions  arising  from  exponential  observations. 
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1.  The  value  of  /3  decreases  as  sample  size  increases. 

2.  The  value  of  /3  decreases  as  E[dJ  <r}  increases. 

3.  The  value  of  / 3  decreases  as  a  increases. 

4.  The  normal  test,  which  dominates  the  component  randomization  test  in  this  case 
of  normally  distributed  observations,  has  asymptotically  the  same  power  as  the 
component  randomization  test.  The  dominance  of  the  normal  test  decreases  as  a 
increases  and  as  n  increases.  This  is  logical  since  many  observations  are  needed 
for  the  tail  areas  of  the  randomization  reference  distribution  to  be  well  developed. 
An  a  value  far  out  in  the  tail  requires  more  observations  for  the  same  power. 
Thus,  for  normally  distributed  observations,  the  largest  loss  of  power  in  using  the 
component  randomization  test  is  for  small  n  and  small  a. 

Exponential  Observations 

Results  are  given  here  for  exponentially  distributed  observations.  The  form  of  the  results 
is  the  same  as  for  the  normal  case  of  the  last  section,  except  here  the  parametric  test  is  distinct 
from  the  normal  test. 

Figure  2  shows  the  parametric  reference  distribution  for  the  parametric  test  under 
exponential  observations  as  determined  by  Phase  1.  The  upper  half  of  the  reference  distribu¬ 
tion  is  plotted  on  normal  probability  paper  for  sample  sizes  n  -  4,  7,  11,  15,  and  infinity.  The 
reference  distribution  as  n  approaches  infinity  is  the  normal  distribution  (the  straight  line  in  the 
figure)  with  the  difference  between  the  normal  distribution  and  the  parametric  reference  distri¬ 
bution  for  finite  sample  sizes  being  greatest  for  small  samples.  For  all  sample  sizes  the  normal 
distribution  lies  above  the  parametric  reference  distributions  above  approximately  the  95% 
quantile  and  below  the  parametric  reference  distribution  otherwise. 

Table  2  contains  the  results  of  this  section  in  tabular  form.  Three  tables  are  shown, 
corresponding  to  a  =  1/128,  1/16,  and  1/8,  respectively.  Each  of  the  entries  are  the  probabil¬ 
ity  of  type  II  error,  j3,  for  various  values  of  n  and  E[dil<r).  Results  for  the  component  random¬ 
ization  test  are  in  the  columns  labeled  "R."  Similarly  the  parametric  test  results  are  under  "P" 
and  the  normal  test  results  are  under  "N."  For  the  randomization  test  the  estimate  of  the  stan¬ 
dard  deviation  of  the  result  is  given  in  parentheses.  The  results  for  "P"  and  "N"  are  determinis¬ 
tic  calculations  from  the  results  of  Phase  1.  The  critical  values  used  for  the  parametric  test  are 
determined  from  Figure  2  and  are  given  in  the  right-hand  column  under  Ca. 

The  most  surprising  aspect  of  these  results  is  that  the  power  of  the  parametric  test  does 
not  dominate  the  power  of  the  component  randomization  test.  Especially  for  alternative 
hypotheses  close  to  //0,  the  component  randomization  test  has  more  power  for  all  n  and  values 
of  a.  At  first  glance  this  is  counter-intuitive,  since  the  usual  circumstance  is  that  power  is  lost 
in  obtaining  distribution-free  properties.  Note  that  over  the  range  of  values  of  E[d,/a-},  how¬ 
ever,  the  parametric  test  is  indeed  more  powerful. 

An  intuitive  rationale  for  the  greater  power  of  the  component  randomization  test  for  alter¬ 
native  hypotheses  close  to  the  null  hypothesis  is  as  follows:  the  reference  distribution  for 
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TABLE  2  —  Type  II  Errors  (p)  for  Randomization  (R),  Parametric  (P),  and 
Normal  (N)  Tests  Arising  from  Exponential  Observations 


X.Y  ~  Exp 

a  -  1/128  -  .0078125  -  2.417 


127 

128 

127 

128 

.990 

.86 

.89 

.86 

.59 

.48 

.40 

.34 

.07 

.06 

2.54 

(.0063) 

(.0090) 

(.0086) 

(.02) 

127 

128 

127 

128 

.990 

.76 

.80 

.79 

.30 

.20 

.18 

.07 

.008 

.007 

2.49 

(.013) 

(.0089) 

(.0046) 

(.01) 

127 

128 

127 

128 

.991 

.66 

.70 

.69 

.13 

.06 

.07 

neg 

neg 

neg 

2.47 

(.024) 

(.023) 

(.03) 

a  -  1/16  -  .0625z„  - 

1.534 

.9375 

.9375 

.940 

.67 

.70 

.69 

.40 

.30 

.31 

.21 

.06 

.07 

1.51 

(.0074) 

(.0077) 

(.0065) 

(.004) 

.9375 

.9375 

.939 

.54 

.61 

.61 

.17 

.12 

.13 

.035 

.01 

.01 

1.52 

(.0091) 

(.0069) 

(.0034) 

(.007) 

.9375 

.9375 

.938 

.42 

.45 

.45 

.0.68 

.04 

.04 

.005 

0005 

.0005 

1.53 

(.016) 

(.0046) 

(.0013) 

(.004) 

9375 

9375 

.938 

.34 

.34 

.34 

.01 

.01 

.01 

neg 

neg 

neg 

1.53 

(.019) 

(.007) 

(.02) 

a  -  1/8 

-  1.25z 

1  150 

.875 

.875 

.89 

.51 

.54 

.56 

.22 

.16 

.18 

.08 

.03 

.035 

1.09 

(.0079) 

(.0065) 

(.0043) 

(.004) 

.875 

.875 

.88 

.40 

.43 

.41 

.09 

.06 

.07 

.013 

.005 

.005 

1.11 

(.0090) 

(.0052) 

(.002) 

(.006) 

.875 

.875 

88 

.29 

.30 

.30 

.03 

.01 

.01 

.002 

neg 

neg 

1.13 

(.014) 

(.0031) 

(.0008) 

(.006) 

.875 

.875 

88 

.20 

.20 

.20 

neg 

neg 

neg 

neg 

neg 

neg  1 

1.14 

(.016) 

(.01) 

VOL  29,  NO.  4,  DECEMBER  1982 


NAVAL  RESEARCH  LOGISTICS  QUARTERLY 


9  ..._.JT 


•  t 


642 


S  J.  DEUTSCH  AND  B.  W,  SCHME1SER 


exponential  observations  has  heavier  tails  than  the  normal  (a4  —  3.75  for  n  =  4,  and  the  nor¬ 
mal  value  is  a4  =  3)  while  the  randomization  reference  distribution  has  light  tails  (a4  <  3) 
since  its  range  is  always  finite.  A  distribution  with  a  high  value  of  a4  is  more  peaked  and  has 
heavier  tails  than  a  distribution  with  smaller  a4.  Small  a4  values  indicate  light  tails  and  heavy 
shoulders.  Now  an  alternative  hypothesis  close  to  the  null  hypothesis  is  more  easily  detected 
by  the  distribution  with  heavy  shoulders,  in  this  case  the  randomization  reference  distribution, 
since  many  observations  are  close  to  the  mean.  On  the  other  hand,  alternative  hypotheses  in 
the  tails  are  more  easily  detected  by  the  parametric  test  due  to  its  reference  distribution  having 
more  observations  in  the  tails. 

Note  that  the  power  of  the  normal  theory  test  is  very  similar  to  the  parametric  test. 
While  not  appropriate  for  exponential  observations,  the  nominal  value  of  a  is  not  far  from  the 
true  value,  as  indicated  in  the  columns  under  E[djcr)  —  0.  The  power  of  the  normal  test  is 
similar  to  the  power  of  the  parametric  test  for  all  alternative  hypotheses  and  sample  sizes, 
although  the  greatest  difference  is  for  small  sample  sizes  where  the  parametric  reference  distri¬ 
bution  differs  the  most  from  the  normal  distribution. 

Note  also  that  as  the  sample  size  increases  the  results  of  all  three  tests  converge  to  the 
same  values.  This  must  be  true  for  the  parametric  and  normal  tests  due  to  the  central  limit 
theorem.  This  is  equivalent  to  the  reference  distributions  becoming  the  same  in  Figure  2  for 
large  sample  sizes.  The  component  randomization  test  also  converges  to  the  same  power  for 
large  sample  sizes,  since  this  test  is  asymptotically  the  normal  theory  test  for  large  sample  sizes. 
However,  the  nonnormality  in  terms  of  sample  size  is  of  interest  here. 

Figure  3  shows  graphically  the  values  of  Table  2  for  the  parametric  and  component  ran¬ 
domization  tests.  The  normal  test  is  not  shown  since  it  is  only  approximately  correct  for  this 
case  and  since  the  results  are  so  similar  to  the  parametric  test.  All  of  the  same  general  relation¬ 
ships  between  a,  n,  and  Eidja-}  are  true  here  as  for  the  normal  case.  0  decreases  as  any  of 
the  three  factors  increase.  The  striking  difference  is  that  the  parametric  test  does  not  dominate 
the  component  randomization  test,  as  just  discussed. 

It  is  of  interest  to  compare  the  results  of  this  section  to  those  of  the  last  section  for  nor¬ 
mal  observations.  The  power  of  the  component  randomization  test  is  generally  greater  for 
exponential  observations  than  for  normal  observations,  the  greatest  difference  being  .1  for 
n  =  4,  a  —  1/16,  and  E{dl /<r)  -  .5.  However,  for  large  values  of  Eidja-)  and/or  for  large  n 
the  normal  observations  lead  to  the  greater  power.  In  these  cases  the  power  is  so  great  for  both 
tests,  however,  that  the  difference  is  not  usually  important.  Thus  it  appears  that  the  component 
randomization  test  actually  has  better  overall  power  for  exponential  observations  than  for  nor¬ 
mal  observations. 

This  relationship  does  not  hold  true  for  the  parametric  test.  For  small  a,  the  parametric 
test  performs  best  under  normal  observations.  For  a  -  1/16  the  test  has  similar  power  for  both 
types  of  observations  and  for  large  a  values  the  parametric  test  performs  best  for  exponential 
observations.  That  this  should  be  true  is  obvious  from  Figure  2.  The  reference  distributions 
all  cross  the  normal  distribution  around  the  .95  quantile.  Since  power  of  the  parametric  test 
depends  on  this  distribution  only  through  the  (1-a)  quantile,  power  should  indeed  be  similar 
for  a  —  1/16. 
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Note  that  the  result  of  all  the  parametric  reference  distributions  having  the  same  .95 
quantile  is  that  the  normal  theory  test  is  very  close  to  exactly  valid  for  exponential  observations 
for  any  sample  size  if  a  =  .05.  Smaller  or  larger  values  of  a  lead  to  inaccuracies  in  the  normal 
test. 

Uniform  Observations 


Results  corresponding  to  those  of  the  last  section  are  given  here  for  uniformly  distributed 
observations.  Figure  4  shows  the  reference  distributions  for  sample  sizes  n  =  4,  7,  11,  15,  and 
infinity  arising  from  uniform  observations.  Again  the  straight  line,  the  normal  distribution,  is 
the  limiting  distribution  as  n  approaches  infinity.  Not  shown  is  the  limiting  case  of  n  =  1,  the 
distribution  of  the  difference  between  two  uniformly  distributed  random  variables,  which  is  the 
triangular  distribution.  All  of  these  reference  distributions  have  a  finite  upper  bound  and  there¬ 
fore  lie  above  the  normal  line,  whereas  the  exponential  reference  distributions  were  below  the 
normal  distribution.  Note  that,  for  the  a  values  considered,  the  reference  distribution  is  essen¬ 
tially  identical  with  the  normal.  Only  for  a  greater  than  .01  is  the  difference  noticeable  in  Fig¬ 
ure  4  and  even  there  it  is  not  as  great  as  in  the  case  of  exponential  observations. 
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Table  3  shows  the  probability  of  type  II  error  for  various  sample  sizes  and  alternative 
hypotheses  E[djcr)  for  the  component  randomization  and  parametric  tests.  From  the  table  it 
would  appear  that  the  normal  test  works  well  in  this  case  as  an  approximation  to  the  true 
parametric  test,  even  for  small  sample  sizes. 


TABLE  3  —  Type  II  Errors  (f3)  for  Randomization  (R),  Parametric  (P), 
and  Normal  (N)  Tests  Arising  from  Uniform  Observations 
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The  results  for  uniform  observations  are  similar  to  those  of  the  prior  two  cases,  with  /3 
decreasing  as  a  function  of  n,  a,  and  E[dt/<r).  Again  of  interest  is  that  the  component  random¬ 
ization  test  has  greater  power  (lesser  fi)  for  E{dJ(r)  -  .5  and  a  —  1/16  and  a  —  1/8  than  the 
parametric  test.  That  is,  even  though  the  parametric  test  has  better  power  for  most  combina¬ 
tions  of  n ,  a,  and  £” { </,/ cr } ,  it  does  not  dominate  the  component  randomization  test. 
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It  is  apparent  by  comparing  Tables  1  and  2  that  the  normal  test  does  indeed  approximate 
the  properties  of  the  parametric  test  well  in  this  case.  The  power  never  differs  by  more  than 
.01  except  for  large  values  of  E{dJ<r)  where  the  difference  in  tail  weights  between  the  uniform 
and  exponential  distributions  has  an  effect.  If,  however,  a  value  of  .0001  and  a  small  sample 
size  were  used,  Figure  4  shows  that  the  normal  test  is  not  so  good  an  approximation  since 
£*9999,  equal  to  3.35  for  n  =*  4,  does  differ  from  the  normal  value  z  9999  =  3.45,  resulting  in  a 
biased  indication  of  the  true  a  value. 

Absolute  Lambda  Observations 

In  the  previous  three  subsections,  the  power  of  the  component  randomization  test  and  the 
corresponding  parametric  test  has  been  examined  for  normal,  exponential,  and  uniform  obser¬ 
vations.  These  three  distributions  were  selected  as  three  distributions  representing  a  wide  range 
of  tail  weights.  The  uniform  has  the  lightest  tails  (02  =  1.8),  the  normal  has  medium  tails 
(02  -  3),  and  the  exponential  distribution  has  heavy  tails  (02  =  9).  A  question  of  interest  is 
whether  tail  weight,  as  measured  by  the  fourth  standardized  moment,  is  really  the  central  factor 
in  determining  the  effect  of  underlying  distribution  on  the  power  of  the  tests. 

An  analysis  similar  to  the  last  three  sections  is  performed  here  to  gain  insight  into  the 
fourth  moment’s  impact  on  power.  In  particular,  observations  are  generated  from  the  absolute 
lambda  distribution  (Deutsch  and  Schmeiser  (4],  with  moments  one  through  four  identical  to 
the  standardized  normal  distribution.  Despite  having  the  same  skewness  and  kurtosis,  this  den¬ 
sity  function  has  a  shape  quite  unlike  the  normal. 

Figure  5  shows  the  reference  distribution  arising  from  the  aboslute  lambda  distribution 
with  parameters  Ai  —  0,  A2  “  —1.575,  A3  —  —.2247,  and  A4  =  .5  for  a  sample  size  of  four.  The 
reference  distributions  for  larger  sample  sizes  are  not  shown  since  in  the  figure  they  would  not 
be  distinguishable.  Also  included,  for  comparitive  purposes,  is  the  normal  distribution  which 
appears  as  a  straight  line.  While  the  difference  between  the  reference  distribution  shown  and 
the  normal  is  slight,  compared  to  previous  examples,  it  is  significant.  The  major  difference 
between  this  and  previous  examples  is  that  the  nonnormality  occurs  in  the  body,  rather  than 
the  tail,  of  the  distribution. 

Table  4  shows  the  results  of  the  Monte  Carlo  determination  of  the  power  of  the  com¬ 
ponent  randomization  and  the  parametric  tests  for  this  underlying  distribution.  The  results  for 
the  normal  test  and  the  parametric  test  are  identical  to  two  places  of  accuracy,  and  therefore  the 
results  are  given  only  for  the  parametric  test.  For  the  cases  studied,  the  power  of  the  test  does 
not  differ  noticeably  from  the  results  for  normally  distributed  observations.  Note,  however, 
that  the  power  of  the  component  randomization  us:  i«  somewhat  less  for  these  observations 
than  for  normal  observations.  The  discontinuity  of  ihe  underlying  distribution  seems  to  affect 
the  component  randomization  test  while  not  affecting  the  parametric  test. 

4.  CONCLUSIONS 

The  power  of  the  component  randomization  test  and  the  parametric  test  is  given  in  the 
form  of  operating  characteristic  curves  for  normal,  exponential,  uniform,  and  absolute  lambda 
observations.  The  power  of  the  component  randomization  test  is  usually  less  than  the  power  of 
the  appropriate  parametric  test,  but  is  not  dominated  for  all  alternative  hypotheses.  That  is,  in 
some  cases,  the  distribution-free  randomization  test  has  more  power  than  the  parametric  test 
which  assumes  knowledge  of  the  exact  distribution  of  the  observations,  including  variance. 
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TABLE  4  —  Type  11  Errors  (f3)  for  Randomization  (R),  Parametric  (P), 
and  Normal  (N)  Tests  Arising  from  Absolute  Lambda  Observations 
(0,  2.227,  -.224745,  .5) 


8/cr 

.5 

10  _ 

1.5 

n 

R  P 

R 

P 

R  P 

a  -  1/128  -  .0078125 

2.417 

7 

.89  .87 

.87 

.42 

.30  .06 

11 

.87  .78 

.36 

.19 

neg 

15 

.72  .68 

(.036) 

.14 

(.028) 

.08 

neg 

a  -  1/16  -  .0625 

za  -  1.534 

4 

.71  .67 

.68 

.32 

.18  .06 

7 

.66  .59 

.21 

.14 

.013  .007 

11 

.52  .47 

.04 

.03 

neg 

15 

.37  .34 

(.039) 

.013 

(.009) 

.013 

neg 

a  -  1/8-  .125 

za  —  1.150 

4 

.66  .58 

.29 

.20 

.06  .03 

7 

.49  .45 

.09 

.07 

.003  .001 

11 

.33  .32 

.01 

.01 

neg 

15 

.29  .23 

(.035) 

.006 

(.006) 

0 

neg 

The  robustness  of  the  normal  test  of  location,  a  well-known  property,  was  observed  in 
relation  to  the  performance  of  the  correct  parametric  test.  The  difference  in  reference  distribu¬ 
tions  is  seen  to  be  greatest  in  the  tails  of  the  distributions  for  common  distributions.  The 
power  of  the  component  randomization  test  is  different  for  normal  and  absolute  observations, 
even  though  both  have  identical  first  four  moments.  The  power  of  the  parametric  test,  on  the 
other  hand,  is  essentially  the  same  for  both  types  of  observations. 

Since  in  practice  the  appropriate  parametric  test  is  not  known,  the  component  randomiza¬ 
tion  test  is  a  viable  alternative  to  the  normal  test  for  samples  no  larger  than  25.  Especially 
when  the  variance  is  unknown  the  component  randomization  test  costs  little  in  terms  of  power 
loss. 
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ABSTRACT 

Preference  for  food  in  the  function  of  the  time  since  last  consumption  and 
total  preference  in  the  function  of  eating  frequency  are  equivalent  mathematical 
representations  of  attitudes  governing  food  consumption.  The  observed  and 
postulated  properties  of  these  functions  imply  a  formal  correspondence  between 
preference  maximizing  and  variety  seeking  behavior. 


INTRODUCTION 

Repetitive  consumption  is  defined  here  as  the  phenomenon  of  consuming  identical  goods 
repeatedly  over  time.  Repeated  demand  for  identical  goods  may  stem  from  technological 
necessity  such  as  replacement,  or  from  the  desire  to  obtain  the  sensory  stimulus  associated  with 
the  consumption  of  known  commodities.  This  article  is  concerned  with  the  latter  case. 

Food  consumption  is  the  most  prevalent  example  of  repetitive  consumption,  although  the 
same  phenomenon  may  be  manifested  in  "consuming"  music,  sights,  or  even  certain  fragrances. 
The  common  characteristic  of  these  phenomena  is  that  the  act  of  consumption  tends  to  create  a 
feeling  of  satiation  that  inhibits  the  desire  for  further  stimulus  of  the  same  kind  for  at  least  a 
short  time  period.  As  time  progresses,  however,  the  desire  may  build  up  again,  and  the 
consumption-satiation  sequence  is  repeated. 

The  mathematical  representation  of  this  process  is  possible  through  the  formulation  of  a 
preference  function  over  time  which  expresses  the  desire  for  consuming  a  particular  food  item 
during  the  consumption-satiation  cycle.  The  evidence  reported  by  Balintfy,  Duffy  and  Sinha  II] 
indeed  indicates  that  such  a  preference-time  function  exists.  Responses  to  questionnaires  indi¬ 
cated  that  preference,  i.e.,  the  indicator  of  anticipated  satisfaction,  increases  monotonically  as  a 
concave  function  of  time  since  last  consumption. 

Another  aspect  of  this  phenomenon  is  the  frequency  of  consumption  for  food  items  and 
people’s  ability  to  estimate  their  desired  frequency.  It  was  postulated  that  such  frequency  esti¬ 
mates  are  meaningful  only  if  they  are  predicated  upon  optimizing  behavior.  In  that  case  the 
desired  frequency  estimate  must  be  the  locus  of  the  maximum  of  total  preference  as  a  function 
of  eating  frequency.  Indeed,  when  school  lunches  were  planned  on  the  basis  of  a  quadratic 
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approximation  of  these  preference-frequency  functions,  double  blind  experiments  showed  a 
significant  increase  in  participation  and  food  consumption  [Balintfy,  Rumpt,  Sinha  2). 

The  purpose  of  this  article  is  to  show  that  the  preference-time  and  preference-frequency 
relations  are  equivalent  properties  of  repetitive  consumption  and  the  functional  forms  of  these 
relations  follow  a  well-defined  rule  of  correspondence.  Further  analysis  of  these  equivalence 
relations  appears  to  explain  variety  seeking  behavior  in  terms  of  mathematical  optimization 
models. 

The  equivalence  relations  identified  in  this  paper  provide  the  theoretical  foundation  for 
selecting  the  appropriate  utility  functions  and  coefficient  estimation  procedures  in  related  sur¬ 
veys.  These  utility  functions  are  very  important  in  optimizing  menu  planning  and  scheduling 
decisions  by  mathematical  programming  techniques  for  a  variety  of  institutional  food  service 
systems.  Experimental  evidence  indicates  that  preference  maximized  menus  do  increase  satis¬ 
faction  and/or  decrease  food  cost  significantly  in  practice  [3], 

DEFINITIONS. 

a.  fit)  is  the  preference-time  function  where  t  is  the  time  interval  at  which  a  unit 
portion  of  a  food  is  consumed. 

b.  -v  =  7V /  is  the  frequency  of  consumption  over  a  time  horizon  T. 

c.  #(x)  is  the  preference-frequency  function  representing  the  total  preference 
obtained  during  a  lime  period  T  by  consuming  a  unit  portion  of  an  item  x  times 
repetitively  at  t  -  T/x  times  apart  during  that  period. 

d.  gix)  =  xfit)  =  xfiT/x),  T  >  0. 


Graphs  of  the  functions  fit)  and  gix)  are  illustrated  below. 


Figure  t  Preference-Time  Function  Funmr  2.  Preference-Frequency  Function 
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PROPOSITION  1:  If  fit)  is  a  twice  differentiable  continuous  function  defined  for  t  ^  0, 
strictly  concave,  monotonically  increasing,  bounded  from  above  and  has  a  positive  root,  i.e., 
fito)  =  0  for  r0  >  0,  then 

(i)  lim  t/it)  =  0 

I  — ■  oo 

(ii)  The  function  gix)  =  xfiT/x)  with  T  >  0  is  strictly  concave  for  x  ^  0 

(iii)  The  function  gix)  obtains  its  maximum  at  a  unique  finite  point  x*  which  is  a 
solution  of  the  equation  fiT/x)  «=  iT/x)fiT/x). 

(See  Figures  1  and  2.) 

PROOF: 

(i)  Since  fit)  >  0,  it  is  clear  that  t  ■  fit)  >  0.  Let  us  assume  that  there  exist 
e  >  0  and  a  sequence  tn  — *  <»  for  which  t„fitn)  >  €.  We  may  assume  for  a 
subsequence  that  t„+l/ tn  >  2,  i.e.,  tj  2  >  r„_i-  Then/(r„)  >  e/tn  and  since  fit) 
is  a  decreasing  function,  it  is  implied  that  /(/)  >  e/ tn  for  t  <  t„.  Consequently, 
fitn)  _  /(/„/ 2)  >  i«/t„)  itj 2)  -  e/2.  Since  the  it„/2,  t„)  intervals  are  not  over¬ 
lapping,  fit„)  ^  £  { /(/ ,)  -  fit,/2)\  +  /(r,/2)  >  nie/2)  +  fitJD  which  con- 

i- 1 

tradicts  the  assumption  that  fit)  is  bounded. 

(ii)  We  need  only  to  prove  that  the  second  derivative  g’ix)  is  negative  for  x  >  0. 

Let 

( 1 )  gix)  -  fiT/x)  -  iT/x)  fiT/x) 

Then 

g“ix)  -  it1/ x3) fiT/x). 

Since  fit)  is  strictly  concave,  fiT/x)  <  0.  The  nonnegativity  assumption  on  x  implies 
that  iT2/x})  >  0,  which  preserves  the  inequality,  i.e.,  g"ix)  <  0  is  proved. 

(iii)  Since  gix)  is  strictly  concave,  its  unique  maximum  occurs  where  its  first  deriva¬ 
tive  vanishes,  i.e.,  gix)  -  0.  This  implies  by  (ii)  that:  fiT/x)  -  iT/x) fiT/x), 
or  by  substituting  t  -  iT/x), 

(2)  fit)  -  if  it). 

To  prove  that  (2)  has  a  positive  solution,  t*  -  iT/x*)  >  0,  implying  that  the 
corresponding  x*  is  finite,  let  us  consider  the  expression  hit)  -  fit)  -  tfit).  The 
function  hit)  is  <  0  for  /  <  r0,  since  fit0)  -  0,  and  positive  for  t  >  /*,  since 
t  fit)—¥  0.  By  continuity,  the  equation  ft(r)-0  must  have  a  solution  for 
t  “  t*  >  to-  At  that  point,  t*,  gix)  attains  its  maximum,  since  the  second  derivative 
g"ix)  is  strictly  negative  at  x*  —  T/t*\ 

g"ix*)  -  it  lx*')  fiT/x*)  -  it*3/  T)fit*)  <  0. 
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COROLLARY  1:  The  tangent  of  gix)  at  x  -  0  is  equal  to  the  upper  bound  of  fit),  i.e., 
g  (0)  —  lim  fit )  —  a.  Taking  the  limit  in  (1)  as  x  — 1 *  0,  and  taking  into  account  (i) 

f— oo 

g'iO)  =  lim  {/( T/x)  -  iT/x)fiT/x))  =  a 

x—0 

since 


lim  fiT/x)  =  lim  fit)  —  a  and  lim  iT/x)fiT/x)  =  lim  tfit)  -  0. 

x~  0  i— oo  x— *0  oo 


COROLLARY  2:  The  value  of  x  at  which  gix)  obtains  its  maximum  is  given  by 
x*  =  iT/t*)  where  t *  is  the  abscissa  of  the  point  where  the  line  from  the  origin  is  tangent  to 
fit).  By  (2),  fit)/t  =  fit)  at  t  =  /*,  i.e.,  the  mean  of  the  function  is  equal  to  its  derivative. 
This  is  also  the  point  where  fit)/ 1  reaches  its  maximum  and  the  cord  from  the  origin  becomes 
the  tangent. 

PROPOSITION  2:  If  gix)  is  a  continuous  twice  differentiable  concave  function,  with  a 
zero  root,  i.e.,  g(0)  —  0,  and  bounded  from  above,  then  fit),  defined  as  fit)  -  it/T)  giT/t) 
where  T  is  a  positive  constant,  t  >  0,  has  the  following  properties:  (i)  it  is  strictly  concave,  (ii) 
it  is  monotonically  increasing,  (iii)  it  is  bounded  from  above,  and  furthermore,  (iv)  if  gix)  has 
a  maximum  at  a  positive  finite  point,  then  fit)  has  a  positive  root. 

PROOF: 

(i)  In  order  to  show  the  concavity  of  fit),  it  is  sufficient  to  prove  that  fit)  <  0  for 
t  >  0 

fit)  «  (l/Dg(r/r)  -  (l/f)g'(77t) 
fit)  -  iT/t3)  giT/t). 

Strict  concavity  of  g(x)  implies  g"ix)  <  0  for  every  x.  For  t  >  0,  iT/t3)  >  0, 
preserving  the  inequality.  So  fit)  <  0  or  fit)  is  a  strictly  concave  function. 

(ii)  In  order  to  show  that  fit)  is  monotonically  increasing,  it  is  sufficient  to  prove  that 
fit)  >  0  for  t  >  0,  or  by  (3): 

(4)  giT/t)  >  iT/t)g'iT/t). 

Using  the  mean  value  theorem  for  the  points  iT/t)  and  0,  we  get: 
giT/t)  -  giO)  -  iT/t)g'iT/r) 

where 

(5)  0  <  iT/r)  <  iT/t). 

Since  it  is  given  that  g(0)  -  0,  (4)  becomes:  iT/ t)g'iT/r)  >  iT/t)g  iT/t),  or  since 
iT/t)  >  0,  then  g'iT/r)  >  giT/t)  which  is  always  true  for  a  strictly  concave  function 
because  of  (5). 

(iii)  Since  gix)  is  bounded  from  above,  and  g(0)  —  0,  there  always  exists  a  large  number, 
say  a,  such  that  gix)  ^  xa  for  x  >  0  substituting  x-  iT/t)  it  >  0),  we  get 
giT/t)  <  iT/  t)aot  fit)  -  it/T)giT/t)  ^  a  i.e.,  fit)  is  bounded  from  above. 
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(iv)  For  the  last  part,  we  have  only  to  prove  that  the  equation  /(r)  =  0,  or  by  definition  the 
equation  (t/  T)g(T/ 1)  =  0  has  a  solution  for  t  >  0.  That  can  be  done  only  if: 

(6)  g(T/t)  -  0  for  />  0 

However,  since  gU)  starts  from  0,  i.e.,  g( 0)  =  0,  and  obtains  its  maximum  at  a  posi¬ 
tive  finite  point,  being  always  strictly  concave  for  x  >  0,  there  will  exist  a  positive 
x  —  .v0  >  -v*  for  which  it  becomes  0  again.  In  other  words,  there  exists  a  r0  >  0  such 
that  (T/t0)  -  xo  >  0,  for  which  (6)  holds. 

Applying  the  above  properties  to  well-known  results  concerning  the  maximization  of  con¬ 
cave  functions,  the  following  two  corollaries  are  stated  without  proof: 

COROLLARY  3:  If  a  time  horizon  T  is  partitioned  into  i,  time  intervals,  such  that 

m 

£  t,  =■  T,  and  the  preference-time  function  /(/,)  for  a  given  item  is  a  concave  twice 

/-i 

differentiable  function,  then  optimal  repetitive  consumption  strategy  over  time  implies  that  all 
the  time  intervals  must  be  equal,  i.e.,  /,  -  /,  for  all  i  and  j.  This  is  to  say  that  at  optimum  the 
marginal  preferences  with  respect  to  time  must  be  equal,  and  that  is  equivalent  with  the  propo¬ 
sition  that  at  optimum  the  time  intervals  of  consuming  the  same  food  are  equal. 

COROLLARY  4:  Given  concave  unimodal  preference-frequency  g,(x,)  for  different  food 

ffl 

items  j  -  1,  2 . «,  and  positive  quantity  K  such  that  £  x}  -  K,  then  the  consumption  fre- 

,  j-  i 

quencies  of  the  foods,  Xj,  are  optimal  if  the  marginal  preference  with  respect  to  the  eating  fre¬ 
quency  is  equal  for  each  item,  provided  Xj  >  0.  This  solution  is  feasible  only  if  gj(xj)  -  K  has 
a  positive  root  for  all  j,  which  is  not  necessarily  assured. 

COROLLARY  5:  Optimal  consumption  frequencies  that  are  all  positive  and  all  integers 
always  exist  if  K  is  integer. 

PROOF:  The  corollary  implies  that  the  following  nonlinear  integer  programming  problem 
always  has  solution: 

gj(Xj) 

Xj  -  K 

xj  >  0 

Xj,  K  are  integers, 
the  auxiliary  variable  ytJ  where 

m 

Xj  -  £  iy,j,  j  -  1,  2 . n 

i-O 

y>j  -  1 

y,j  >  o 

assuming  that  is  bounded  by  the  intervals  0  <  xt  <  m 
for  all  j. 
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Similarly 

m 

gj(xj)  -  £  gJii)yu  . 
i- o 

Then  we  have  the  following  piecewise  linear  programming 
representation  of  PI: 

n  m 

P2  maximize  ^  gj(<b,j 

subject  to  £  £  iya  -  K 

j- 1  »-o 

X-V/y  ”1  j  -  1.  2,  ...  «. 

0 

The  above  linear  programming  problem  has  «  +  1  rows  and  consequently  an 
(«  +  1)  x  (n  +  1)  basis.  This  means  that  for  in  —  1)  variables,  n  —  1  convexity  constraints 
can  be  satisfied  if,  and  only  if,  ytJ  -  1  yielding  n  -  1  integer  frequencies.  The  remaining  two 
variables  are,  however,  on  adjacent  grid  points  of  the  same  item  due  to  the  concavity  of  gjixj) 
and  according  to  the  simplex  rule  of  optimization.  Let  these  points  be  i  and  /  +  1.  Then  the 
subproblem: 

fro  +  0  +  l)y1+1.y  +  N  -  K 

where  N  <  K,  and  both  N  and  K  are  integers,  can  have  positive  solution  if,  and  only  if,  either 
yu  or  yj+i  J  is  zero.  Consequently,  P2  must  have  a  degenerate  solution  which  assures  that  all 
the  frequencies  of  solving  PI  are  nonnegative  integers.  Thus,  the  optimum  consumption  fre¬ 
quencies  for  PI  can  always  be  determined  by  linear  programming. 

CONCLUSIONS 

The  empirical  evidence  concerning  the  shape  of  preference-time  and  preference-frequency 
functions  was  utilized  to  derive  results  to  show  that  these  functions  are  describing  equivalent 
mathematical  properties.  Consequently,  one  of  the  functions  can  always  be  derived  from  the 
other. 

This  finding  is  important  for  the  consistent  estimation  of  the  parameters  of  the  functions 
chosen  for  the  analytical  representation  of  the  phenomena  under  study.  If,  in  the  simplest 
case,  the  perference-frequency  function  is  approximated  by  a  quadratic  function,  then  the 
preference-time  function  will  be  hyperbolic.  In  this  case,  it  is  sufficient  to  estimate  the  upper 
bound  of  fit)  and  the  locus  of  the  maximum  of  gix),  i.e.,  one  parameter  from  each  function. 
For  more  complex  functions  the  precision  of  curve  fitting  can  be  improved  by  imposing  the 
equivalent  properties  of  both  functions  on  the  data  points  used. 

Another  important  result  is  the  theoretical  foundation  this  investigation  yields  for  the 
proper  formulation  of  objective  functions  for  planning  human  diets.  The  augmentation  of  PI 
or  P2  with  nutritional  and  other  desired  constraints  leads  to  the  formal  statement  of  preference 
maximizing  menu  planning  problems  further  described  in  Balintfy  and  Sinha  [4).  The  solution 
of  these  problems  has  a  great  importance  in  institutional  menu  planning  and  in  individual  diet 
planning. 


NAVAL  RESEARCH  LOGISTICS  QUARTERLY 


VOL.  29,  NO.  4.  DECEMBER  1982 


EQUIVALENCE  RELATIONS  IN  REPETITIVE  CONSUMPTION 


657 


In  a  more  general  sense,  the  existence  of  preference-time  functions  explains  why  repeti¬ 
tive  consumption  may  yield  optimal  consumption  frequencies.  Vice  versa,  whenever  variety 
seeking  consumption  behavior  occurs,  the  presence  of  preference-time  functions  can  be  rightly 
assumed. 
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ABSTRACT 

This  paper  describes  the  background  of  the  Office  of  Management  Budget 
Circular  A-21,  'Principles  for  Determining  Costs  Applicable  to  Grants,  Con¬ 
tracts,  and  Other  Agreements  with  Educational  Institutions,'  that  describes  the 
requirement  for  effort  reporting.  A  sampling  procedure  is  proposed  as  an  alter¬ 
native  to  100%  reporting. 


1.  INTRODUCTION 

Following  the  March,  1979  publication  of  the  revised  Office  of  Management  Budget  Circu¬ 
lar  A-21  ’Principles  for  Determining  Costs  Applicable  to  Grants,  Contracts,  and  Other  Agree¬ 
ments  with  Educational  Institutions,”!  which  promulgates  the  cost  principles  to  be  used  in  the 
determination  of  cost  charged  to  government  sponsored  agreements,  there  was  an  undercurrent 
of  faculty  dissatisfaction  with  the  revised  rules.  Of  major  issue  was,  and  still  is,  the  require¬ 
ments  relating  to  reporting  of  faculty  effort  charged  directly  ot  indirectly  to  government  agree¬ 
ments. 

The  two  acceptable  methods  for  effort  reporting  contained  in  the  revised  A-21  are  con¬ 
tained  in  section  J.6  of  that  document  and  are  called  Monitored  Workload  and  Personal  Activity 
Reporting.  The  Monitored  Workload  method  can  be  used  for  faculty  and  professionals  only, 
whereas  the  Personal  Activity  Reporting  method  can  be  used  for  all  personnel. 

The  major  objection  raised  by  the  faculty  centered  on  the  requirement  (under  either  sys¬ 
tem)  to  account  for  100%  of  their  effort  applicable  to  each  sponsored  agreement,  each  indirect 
cost  category,  and  each  major  function  of  the  University.  This  level  of  accountability  was  con¬ 
sidered  burdensome,  arbitrary,  and  to  some  extent,  useless  work  because  of  the  interrelated 
nature  of  the  research,  instruction  and  other  activities  of  the  faculty.  The  results  of  such  effort 
reports,  though  fulffllng  the  documentation  requirements  established  by  the  regulation,  were 
known  to  be  arbitrary  and  therefore  inaccurate,  and  with  the  added  detail  terribly  expensive  for 
institutions  to  implement. 

That  the  assessment  of  direct  and  indirect  costs  is,  at  times,  highly  subjective  and  approxi¬ 
mate  was  also  recognized  by  the  government.  Section  J.6.b  of  OMB A-21  states: 


This  work  was  supported,  in  part,  under  Contract  N00014-7S-C-056!  with  the  Office  of  Naval  Research. 
tThese  cost  principles  were  revised  again  in  July  1982. 
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*...  it  is  recognized  that,  because  of  the  nature  of  the  work  involved  in  academic  institu¬ 
tions,  the  various  and  often  interrelated  activities  of  professorial  and  professional 
employees  frequently  cannot  be  measured  with  a  high  degree  of  precision,  that  reliance 
must  be  placed  on  reasonably  accurate  approximations  and  that  acceptance  of  a  degree 
of  tolerance  in  measurement  is  appropriate.* 

Because  of  dissatisfaction  with  the  new  effort  reporting  guidelines,  the  major  research 
institutions  formed  a  task  force  under  the  auspices  of  the  Association  of  American  Universities 
(AAU)  to  develop  alternative  methods  of  gathering  faculty  effort  without  the  burden  necessi¬ 
tated  by  the  new  regulations.  As  a  consequence  of  their  deliberations,  it  was  suggested  that  a 
method  of  statistical  sampling  be  used  to  determine  the  indirect  effort  performed  by  the  faculty. 
The  results  of  the  sample  could  then  be  used  in  determining  indirect  cost  rates. 

It  was  also  noted  that  too  often  the  approximations  made  in  effort  reporting  resulted  in 
very  poor  data.  To  improve  the  data,  the  faculty  would  have  to  be  apprised  of  what  specific 
costs  are  to  be  labeled  a  direct  or  indirect  cost.  By  cutting  the  scope  of  reporting,  a  sampling 
procedure  would  permit  increased  dialogue  between  the  administration  and  faculty  thereby  ena¬ 
bling  the  faculty  to  have  a  better  understanding  of  the  process. 

The  idea  of  statistical  sampling  was  explored  between  Stanford  University  and  the  Office 
of  Management  and  Budget.  The  OMB  felt  the  idea  had  possibilities  and  suggested  that  Stan¬ 
ford  submit  a  formal  proposal.  The  authors  carried  out  some  preliminary  work  to  describe  how 
such  a  system  would  work,  and  a  plan  was  presented  to  OMB  and  a  panel  of  Federal  statisti¬ 
cians.  After  their  review,  agreement  was  reached  to  proceed  with  a  more  detailed  study  to 
include  the  process,  the  sample  sizes  to  reach  a  90%  confidence  level  with  a  probable  5%  error 
and  the  statistical  formulas  to  produce  the  desired  data. 

The  plan  was  completed  in  January,  1981  and  submitted  and  accepted  by  OMB  which  had 
chosen  ten  universities  who  had  expressed  interest  in  using  the  statistical  method  as  test  of  the 
proposal.  Unfortunately,  of  those  institutions  who  submitted  their  proposals  based  on  the  sta¬ 
tistical  model,  the  implementing  agency  wanted  the  test  made  in  parallel  with  a  standard  J.6 
method  so  that  the  results  could  be  verified.  In  essence,  this  stringent  requirement  virtually 
terminated  further  consideration  by  the  institutions. 

However,  on  January  7,  1982,  after  several  months  of  added  effort  by  the  AAU  and  other 
higher  education  associations,  a  proposal  for  revision  of  section  J.6.  of  A-21  was  placed  in  the 
Federal  Register.  Included  in  the  proposed  revision  is  paragraph  J.6.b(2)  (c),  which  reads: 

"(c)  The  payroll  distribution  system  will  allow  confirmation  of  activity  allocable  to  each 
sponsored  agreement  and  each  category  of  activity  needed  to  identify  indirect  costs  and 
the  functions  to  which  they  are  applicable.  The  activities  chargeable  to  indirect  cost 
categories  or  the  major  functions  of  the  institution  for  employees  whose  salaries  must 
be  apportioned  (see  J.6.b(l)  (b)  above  )  if  not  initially  identified  as  separate  categories, 
may  be  subsequently  distributed  by  any  reasonable  method  mutually  agreed  to,  includ¬ 
ing,  but  not  limited  to,  suitably  conducted  surveys,  statistical  sampling  (emphasis  added) 
procedures,  or  the  application  of  negotiated  fixed  rates." 

It  is  assumed  that  the  public  comment  will  be  overwhelmingly  favorable  to  the  proposed 
revision,  which  makes  the  publication  of  the  method  developed  at  Stanford  by  the  authors 
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timely.  Use  of  the  method  would  allow  confirmation  of  salaries  directly  charged  to  sponsored 
agreements  by  confirming  notations  on  the  monthly  expenditure  reports  and  less  costly  (and 
less  disruptive  for  the  faculty),  gathering  of  data  necessary  for  indirect  costing  purposes.  What 
follows  is  the  proposal  that  was  submitted  to  OMB  as  a  statistical  sampling  model. 

2.  THE  SAMPLING  PLAN 

The  proposed  sampling  plan  is  designed  for  a  particular  university  for  a  particular  purpose. 
However,  the  method  can  be  extended  to  other  organizations  and  for  other  purposes. 

The  sample  is  designed  to  determine  the  amount  of  effort  expended  by  the  faculty  in 
indirect  activities  such  as  departmental  research,  university  and  student  service  administration. 
For  example,  if  the  indirect  activity  departmental  administration  is  of  interest,  then  we  need  to 
estimate  the  total  cost,  T ’  expended  by  the  faculty  engaged  in  that  activity 

T-  f\S\  +  •  •  ■  fNsN, 

where  f}  is  the  fraction  of  time  that  the  >th  faculty  members  devote  to  departmental  adminis¬ 
tration,  Sj  is  the  academic  year  salary  of  the  y'th  faculty  member  and  N  is  the  number  of 
relevant  faculty.  There  will  be  similar  totals  for  university  administration,  research  administra¬ 
tion,  etc. 

Because  faculty  differ  with  respect  to  their  activities,  we  use  a  stratified  sample  thereby 
reducing  the  sample  variance  of  our  estimates.  Two  methods  of  stratification  appear  natural  in 
a  university  context,  namely,  rank  and  disciplines. 

For  stratification  by  rank  four  strata  may  be  appropriate:  professors,  associate  professors, 
assistant  professors,  department  chairs  and  faculty  with  major  administrative  duties. 
Stratification  by  discipline  might  include  the  Schools  of  Medicine,  Engineering,  and  Education, 
the  Physical  Sciences,  Social  Sciences,  the  Humanities,  etc.  In  making  these  divisions  it  is 
important  to  emphasize  that  within  each  stratum  the  faculty  should  be  relatively  homogeneous. 

Definition  of  the  Universe 

The  universe  is  assumed  to  be  all  faculty  who  conduct  sponsored  activities  and  have  other 
duties  funded  from  nongovernment  sources,  including  the  departmental  operating  budget.  Also 
included  would  be  faculty  totally  funded  from  the  operating  budget  or  a  combination  of  operat¬ 
ing  budget  and  other  nongovernment  sources.  A  faculty  person  funded  100%  on  sponsored 
agreements  or  patient  care  would  be  excluded  from  the  universe.  Since  the  results  of  the  sam¬ 
ple  need  to  be  generalized  to  the  universe  it  represents,  the  sample  must  reflect  the  elements 
necessary  to  fulfill  the  objective  of  the  universe. 

The  Strata 

At  Stanford  University  strata  were  chosen  using  a  combination  of  both  the  rank  and  dis¬ 
cipline  criteria.  These  were: 

(1)  Medical  School  department  chairs. 

(2)  Other  department  chairs  and  laboratory  directors. 
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(3)  Full  professors  in  the  sciences. 

(4)  Full  professors  in  other  fields. 

(5)  Associate  professors. 

(6)  Assistant  professors. 


3.  DETAILS  OF  THE  SAMPLING  PLAN 

3.1  Preliminaries 

For^ach  stratum,  we  have  available  the  total  number,  Nh  of  faculty  members,  and  the 
average,  Yn  of  the  product  of  academic  year  salary  and  proportion  of  time  devoted  to  depart¬ 
mental  administration.  Consequently,  we  can  also  determine  the  sample  variance,  S,2,  of  the 
T-values.  That  is 

S,2  =  Z(Y-  Y:)2/(N:  -  1). 


Actually,  much  of  the  above  data  is  unknown  beforehand.  However,  some  preliminary 
estimates  are  essential  if  we  are  to  sample  with  a  specified  degree  of  accuracy.  One  procedure  is 
to  obtain  estimates  from  past  data,  or  from  a  sample  of  past  data.  Indeed,  an  examination  of 
past  data  may  also  lead  to  improved  stratification.  At  Stanford  an  examination  of  1979-1980 
data  was  used  to  estimate  the  needed  parameters  and  also  to  establish  the  use  of  the  strata. 

3.2  Choice  of  Sample  Sizes 

The  design  depends  on  choosing  an  overall  sample  size,  «,  and  an  allocation  into  the  vari¬ 
ous  strata.  The  total  sample  size  depends  on  the  confidence  level,  c,  and  a  given  relative  error, 
e.  The  higher  the  required  confidence,  the  larger  the  required  sample  size;  the  smaller  the  rela¬ 
tive  error,  or  equivalently,  the  greater  the  precision,  the  larger  the  required  sample  size.  The 
sample  size  also  depends  on  the  variances  in  each  stratum,  but  not  in  a  simple  way. 

The  determination  of  n  is  given  by 

m  -  W  r2 

"  "  w1  +  zn,s2  ~  wl  +  V 

where  T  -  ZN,Sn  V  -  ZN,S,2  and  W  depends  on  the  confidence  level,  c.  and  the  relative 
error  e: 

(2)  W  -  e(Total)/c*. 

where  the  total  is  the  estimated  true  total  cost  of  administration,  c*  “  <f>-1  ((1  +  c)/ 2),  and  4> 
is  the  standard  normal  cumulative  function. 


(3) 


Once  the  sample  size  is  chosen,  the  allocation  n,  to  stratum  /  is  made  in  the  proportion 
N.S. 


w,  “ 


I,V,S, 


n, 


n,  <  N.. 
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3.3  An  Example 

The  following  data  with  five  strata  are  fictitious. 


Stratum 

A 

Number  N, 

in  stratum 

B 

Variance  S2 
in  each 

stratum 

(in  millions) 

C 

Stratum 

standard 

deviation  S, 

(in  thousands) 
D 

H,S, 

(thousands) 

E 

N,S, 

(in  millions) 
G 

ZNjSj 

F 

20 

100.00 

10.0 

200 

.110 

2000 

50 

4.00 

2.0 

100 

.055 

200 

9.00 

3.0 

600 

.329 

1800 

6.25 

2.5 

625 

.342 

1562.5 

5 

300 

1.00 

1.0 

300 

.164 

3.0 

T=  1,825 

1.000 

V  =  5, 862.5 

If  we  wish  to  estimate  the  total  amount  spent  on  departmental  administration  to  within 
5%  of  the  true  total  with  90%  confidence,  and  if  we  estimate  the  true  total  to  be  more  than  $7 
million,  then  from  (2) 

(.05)  (7,000,000) 

1.645 

where  c*  —  1.645  is  obtained  from  the  tables  of  normal  distribution.  From  (1) 

(1.825.000)2 


W 


=  212,766, 


°  (212,766)2  +  (5,862,500,000) 
which  we  round  upward  to  n  =*  66. 


65.1, 


The  allocation  to  strata  is  now  obtained  from  the  proportions  in  column  F.  Thus,  stratum 
one  will  have  a  sample  size  66(.  11)  =  7.3.  The  results  for  the  five  strata  are 

7.3,  3.6,  21.7,  22.6,  10.8. 

Since  these  values  are  not  integers,  it  will  be  safest  to  round  up  to 

8,  4,  22,  23,  11, 

which  yields  a  total  sample  size  of  68. 

Because  Stanford  is  on  a  quarter  system,  the  sample  sizes  within  strata  are  divided  into 
thirds,  representing  the  three  quarters.  To  avoid  unequal  numbers  per  quarter,  and  to  simplify 
the  procedure,  we  round  upwards  so  that  the  samples  are  divisible  by  3,  namely, 

9,  6,  24,  24,  12. 


Thus,  for  example,  we  would  sample  four  per  quarter  in  the  fifth  stratum.  The  total  sample 
size  would  now  be  75. 
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Remark 

An  alternative  procedure,  if  a  sample  is  not  divisible  by  3,  is  to  randomize  the  remainder. 
For  example,  in  the  fifth  stratum,  take  three  faculty  per  quarter  and  assign  the  remaining  two  at 
random  to  two  quarters. 

4.  DERIVATIONS  AND  TECHNICAL  RESULTS 

We  now  provide  some  of  the  derivations  and  rationale  for  the  formulas.  Formulas  that 
are  standard  are  not  derived. 


Let  yljk  be  the  y  value  of  the  j th  faculty  member  in  stratum  /  for  quarter  (semester)  k. 
[Since  Stanford  is  on  a  quarter  system,  we  use  three  quarters  in  our  discussion.  However  the 
procedures  are  easily  modified  for  use  at  a  university  with  the  semester  system  ] 


A  sample  of  size  n  is  chosen  and  then  partitioned  in  accordance  with  the  allocation 


", 


NA 

T 


n. 


N». 


in  each  stratum,  where  we  assume  that  n,  is  rounded  upward  to  be  divisible  by  3  (or  2).  We 
then  measure  the  y-value  of  1/3  (or  1/2)  of  the  sample  in  each  stratum  each  quarter  (or  semes¬ 
ter). 


A  notation  is  needed  to  denote  measurements  taken  in  each  of  the  quarters,  and  we  use 
y„  and  y,  and  y,  to  denote  the  sums  of  y-values  in  quarters  1,  2,  3,  respectively.  That  is 

", 

y,  -  X  ytji. 

7-1 

2n j 

y,  -  X  yu2 
>-«/+ 1 

T,  *“  X  Ti/3> 

>-25(+l 

where  n,  =  nj 3. 

The  sampling  procedure  has  two-stages:  In  the  first  stage  we  sample  the  faculty  within 
each  stratum,  and  in  the  second  stage,  sample  by  quarters  (semesters).  The  quantity  to  be 
estimated  is 

*  *,  3 

(3)  Y  =  X  I  I  V* 

/-I  7-1  *-! 

where  g  is  the  number  of  strata.  The  estimate  used  is 

(4)  T  -  £  —  (y,  +  y,  +  y,), 

T  ", 

which  has  variance 

(5)  V(  V)  -  V(E(  Tlsample))  +  £(  F(  flsample)). 


t  • 


1 


»  • 


t  « 


I  4 


l 


I  t 


4- 
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Presumably,  F(£(  flsample))  is  large  compared  to  £(  V(  Y Isample) ). 

Consider  the  population  with  values 

Y IJ  "h  Yij  2  “h  ^3). 

The  total  Y *  is  just  T,  and  our  sampling  scheme  reduces  to  a  single-stage,  stratified  simple  ran¬ 
dom  sample.  Also, 

N.  "• 

^  =  1^1  y?j- 

,  n<  j- 1 


which  has  variance 


iV,2(S*)2 


1  - 


N, 


=  V{  Y*). 


With  optimal  allocation,  n*=  n(NjS*/T*),  where  T*  =  I.NaS*.  Then 

N2Sf  | .  "  I  _  T*N,S* 


V(  T*)  =  £ 


n,st 


1  - 


N, 


=  1 


1  S: 
\~n~ 


I  N,Sf 


n  _  J  _  n2_ 

n  'j  n 


V * 


where  F*  -  I  A,S*2.  Now,  5*  <  S„  so  that 


F(  r )  <  —  -  V. 
n 


Also,  because  the  expectation  is  linear, 

f  N-  "< 

F(£(  Tlsample))  =  ^  y* 

Consequently,  V(Y)  ^  F(f*),and 


T  ”, 


-  ,/( Y*). 


V(  Y)  =  F(T‘)  <  —  -  K 
n 

which  suggests  the  approximation 
T2 

1 - V  ~  V(Y). 

n 

For  relative  error  e  with  confidence  coefficient  c,  we  must  have 
£{Ktl  -  e)  <  Z  <  K(l+c)i  >  c. 

where  Z  is  normally  distributed  with  mean  Y  and  variance  F(  Y)  =  (T2f  n)  -  V.  This  requires 
that  the  condition 
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(7)  2<J>(  W)  =  24>  (eY/yfVW))  -  1  >  c 

be  satisfied.  But  (7)  is  equivalent  to 

eYiy/vTT)  >  <*r‘ 


where  4>  is  the  standard  normal  cumulative  distribution.  Consequently, 

„  t! 

V(Y)  -  - - V  <  W1, 

n 


from  which  we  obtain 


(8) 


^  T 2 

n  &  — ; - . 

W2  +  V 


This  formula  is  slightly  optimistic  because  it  allows  for  fractional  allocation  to  strata  and 
because  V(Y)  was  approximated.  Since  T  and  V  are  not  known  exactly,  further  caution  would 
be  used  in  picking  rt. 
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ABSTRACT 

This  paper  considers  an  inventory  system  in  which  demand  occurrences  ar¬ 
ise  according  to  a  stationary  Poisson  process,  demand  sizes  at  each  occurrence 
follow  a  logarithmic  distribution,  and  leadtimes  are  random  variables  with  the 
gamma  distribution.  Both  the  exact  and  approximate  distribution  for  leadtime 
demand  are  derived  and  computations  are  performed  which  compare  the  ap¬ 
proximation  to  the  exact  distribution.  The  results  have  application  to  both 
repairable  and  consumable  item  inventory  systems. 


I.  INTRODUCTION 

Many  of  the  inventory  models  which  are  used  in  practice  reply  upon  knowing  the  proba¬ 
bility  distribution  of  demand  over  a  leadtime.  The  common  assumption  is  that  this  distribution 
is  normal.  However,  in  certain  circumstances,  the  normality  assumption  may  in  inappropriate. 
The  purpose  of  this  paper  is  to  derive  the  exact  distribution  of  leadtime  demand  under  the  fol¬ 
lowing  assumptions:  customer  requisitions  occur  according  to  a  stationary  Poisson  process, 
requisition  sizes  follow  a  logarithmic  distribution  and  leadtime  is  a  random  variable  with  the 
gamma  distribution.  In  addition  to  deriving  the  exact  distribution  of  leadtime  demand,  we 
compare  our  results  to  actual  operational  data  and  discuss  a  variety  of  approximations. 

A  number  of  researchers  have  considered  the  problem  of  determining  inventory  operating 
policies  when  requisition  size  exceeds  one.  For  example,  Hausman  [6j  extends  Hadley  and 
Whitin’s  15]  heuristic  while  Archibald  and  Silver  [1]  derive  optimal  (s,S)  policies.  These  stu¬ 
dies  differ  from  ours  in  two  ways.  First,  in  every  case  leadtime  was  assumed  to  be  determinis¬ 
tic.  Second,  they  focus  primarily  on  describing  optimal  and  suboptimal  ordering  policies.  Our 
interest  is  in  a  detailed  examination  of  the  distribution  of  demand  over  leadtime. 
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2.  THE  LOGARITHMIC  DISTRIBUTION 

The  logarithmic  (or  log  series)  distribution  was  originally  derived  by  Fisher,  Corbet  and 
Williams  [4]  and  has  been  discussed  by  Sherbrooke  ill)  in  connection  with  inventory  problems. 
It  can  be  derived  as  a  limiting  case  of  the  negative  binomial  distribution  and  has  the  form 

( 1 )  /  (_y )  =  - r~ - —  for  x  =  1 ,  2,  . . .  for  0  <  0  <  1 . 

—.v  ln(l  —  0) 


A  maximum  likelihood  estimator  for  0  is  obtained  as  follows.  The  likelihood  function, 
L  (0),  is 

(2)  JL(0)  =  0"*[-/«(l  -  0)1~" 

L'(H)  =  nx  0'“-|[-ln(l  -  0)]-" 

+  0"*(- «)[-/«(!  -  0)]-"-'  •— ! — . 


The  maximum  likelihood  estimator,  0,  solves  C'(0)  =  0  or 

n  (0)"'_l  (— ln(l  -  0))-"-|{jf(-/w(l  -  0))— 0/(1  -  0)}  =  0 

which  gives 


A  —  S  - 

-(1  -  0 ) I n ( 1  -  0) 

•■'here  .v  is  the  observed  sample  mean.  (This  estimator  could  have  also  been  derived  by  the 
method  of  moments  as  suggested  by  Chakravarti,  Laha  and  Roy  [3].)  Note  that  this  is  a  tran¬ 
scendental  equation  in  0  which  can  be  solved  efficiently  by  interval  bisection  since  the  right 
hand  side  is  an  increasing  function  of  0.  Alternately,  one  can  use  a  recursive  scheme  as  fol¬ 
lows;  let  w  =  0/1  —  0)  then  (3)  is  equivalent  to  ,vln(l  +  <u)  =  w.  This  suggests  the  recursion 
co„+i  =  .vln(l  +w//).  As  an  example,  the  observed  sample  mean  of  the  data  represented  in 
Table  1  is  3.94.  Starling  with  cu0  —  -5  we  obtain 

ca,  =  1.5975 
(02  ~  3.76 
=  9.08328 


(02  =  9.1 1864 

which  results  in  0  =  <a7/(I  +  ca7)  =  .9012.  Using  this  value  in  (1)  we  can  compare  the 
observed  and  theoretical  frequencies  for  a  typical  item.  These  results,  which  are  reported  in 
Tabic  1  show  the  close  agreement  between  the  observed  and  predicted  cumulative  distribution 
functions  for  this  item. 


NAVAI.  RESEARCH  LOGISTICS  QUARTERLY 


VOL  29.  NO  4.  DEC  EMBER  1982 


LOGARITHMIC  POISSON  GAMMA  DISTRIBUTION 


669 


TABLE  1  —  Comparison  of  Observed  Frequencies  and  Those  Predicted 
by  Logarithmic  Distribution  for  a  Typical  EOQ  Type  Item 


X 

Number  of 
Observations 

Observed 

Frequency 

Theoretical 

Frequency 

Observed 

Cumulative 

Theoretical 

Cumulative 

1 

93 

.4247 

.3896 

.4247 

.3896 

2 

31 

.1416 

.1755 

.5663 

.5651 

3 

13 

.10594 

.1054 

.6257 

.6705 

4 

15 

.0685 

.0712 

.6942 

.7417 

5 

10 

.0457 

.0514 

.7399 

.7931 

6 

15 

.0685 

.0386 

.8084 

.8317 

7 

8 

.0365 

.0298 

.8449 

.8615 

8 

8 

.0365 

0.235 

.8814 

.8850 

9 

3 

.0137 

,0188 

.8951 

.9031 

10 

4 

.0183 

.0152 

.9134 

.9190 

11 

7 

.0320 

.0125 

.9454 

.9315 

12 

3 

.0137 

.0103 

.9591 

.9418 

13 

0 

.0000 

.0086 

.9591 

.9504 

14 

1 

.0046 

.0072 

.9637 

.9576 

15 

2 

.0091 

.0061 

.9728 

.9637 

16 

1 

.0046 

.0052 

.9774 

.9689 

17 

0 

.0000 

.0043 

.9774 

.9732 

18 

1 

.0046 

.0037 

.9820 

.9769 

19 

0 

.0000 

.0031 

.9820 

.9800 

20 

2 

.0091 

.0027 

.9911 

.9827 

25 

2 

.0091 

.0008 

1.0000 

.9915 

3.  THE  LPG  DISTRIBUTION 

Let  us  now  assume  that  requisitions  are  generated  by  a  Poisson  process  and  the  requisi¬ 
tion  size  has  a  logarithmic  distribution.  (That  is,  the  demand  process  is  a  compound  Poisson 
process  with  logarithmic  compounding  distribution.)  It  is  well  known  that  (see,  for  example, 
Quenouille  [10])  that  the  total  number  of  units  demanded  in  any  fixed  time  /,  say  Z(r),  has  the 
negative  binomial  distribution.  In  particular,  we  obtain 

(4)  P{Z(t)  -  x)  -  --fo*  *-(l  -  OV'0 *  for  x  =  0,  12,  . . . 

x!  Her) 

where  c  -  -\/ln(  1  -  9)  and  X  is  the  requisition  arrival  rate. 

Boswell  and  Patil  [2]  give  fifteen  different  derivations  of  the  negative  binomial  distribu¬ 
tion,  thus  accounting  for  its  power  in  describing  many  common  phenomena. 

Now  let  us  assume  that  the  procurement  leadtime,  r,  is  a  continuous  nonnegative  random 
variable  with  probability  density  g(r).  In  general,  the  number  of  units  demanded  during  a  ran¬ 
dom  leadtime  r  is  a  random  variable  with  probability  function  h(x)  given  by 

15)  h(x)  -  f  f(x  |r )  g(r)dT 

"0 
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where  /(x/t)  is  the  probability  function  of  the  number  of  units  demanded  in  a  time  r.  Under 
our  assumptions,  /(x/r)  has  the  negative  binomial  distribution.  Since  cr  is,  in  general,  not  an 
integer,  we  use  the  gamma  function  representation  for  the  factorials,  so  that 


(6) 


r (ct  +  x) 
f(cr) 


(1  —  0)fT  g{r)dT. 


Using  the  fact  that  Ha)  =  (»  —  l)T(a  -  1),  we  have 

m  nc n  £  W*S„. 

1  VCT'  j- o  *-i 

where  the  coefficients  Sxk  are  known  as  Stirling  numbers  of  the  first  kind  and  can  be  computed 
from  the  recursion 

(8)  Sxk  =  +  (x  -  1)  S'*-,.*, 

for  k  =  1 ,  2 . x  and  x  —  1 ,  2,  .... 

with  =  0  for  all  x. 

Furthermore,  from  the  definition  of  c, 

(9)  (1  —  0)CT  —  exp{cr  Inf  1  —  0))  =  e~kT , 
so  that  we  may  now  write 

(10)  /i(x)  =  £  c*S**  J"Q  rke~kT  g(r)dT. 

X'  k~  1 


We  now  specialize  to  the  case  where  g(r)  has  the  gamma  distribution  with  parameters  a 
and  0  so  that 


(II) 


g(r) 


e~0T  for  r  >  0. 


Since  leadtimes  must  be  nonnegative,  the  gamma  distribution  should  provide  sufficient  flexibil 
ity  to  model  leadtime  variability  in  many  operating  environments. 


Using  the  fact  that 

(12)  f  °°  rk+a~x  e-(k+0)T  =  T (k  +  a)/ (A.  +  0)k+a 

J  0 

and  that,  as  above. 


(13) 


T(k  +  a) 

r(a) 


k 


£  aJSkj> 
j- 1 


we  obtain  the  following  as  the  probability  function  for  the  number  of  units  demanded  in  a  lead- 
time: 


(14) 


Mx) 


(0/(A  +/3))a 

I  a 

0  0X  i 

\+0  x!  Ji 


A  +  0 


for  x  =  0 

for  x  -  1,  2,  3,  . . . 
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We  call  this  the  LPG  distribution  (for  Logarithmic-Poisson-Gamma).  Its  four  parameters 
are  a,  0,  9  and  A  (9  and  A  determine  c).  An  example  of  the  LPG  distribution  is  presented  in 
Table  2  for  a  =  1,0  =  1,0  =  8  and  A  =  1. 


TABLE  2  -  The  LPG 


Distribution  Parameters 
a  *  /.  0  =  1. 


9  =  .8.  A  -  / 


X 

h(x) 

H(x ) 

— 

.5000 

.5000 

1 

i 

.1243 

.6243 

2 

.0806 

.7049 

3 

.0589 

.7638 

4 

.0451 

.8089 

5 

.0355 

.8444 

6 

.0283 

.8727 

7 

.0228 

.8955 

8 

.0185 

.9140 

9 

.0151 

.9291 

10 

.0124 

.9415 

11 

.0101 

.9516 

12 

.0083 

.9599 

13 

.0069 

.9668 

14 

.0057 

.9725 

15 

.0047 

.9772 

16 

0039 

.9811 

17 

.0032 

.9843 

18 

.0027 

.9869 

19 

.0022 

.9892 

20 

.0018 

.9910 

4.  A  RECURSIVE  FORMULA  FOR  THE  DETERMINATION  OF  THE  LPG  DENSITY 


For  numerical  calculations,  we  found  the  following  recursion  to  be  useful.  Let  us  assume 
that  a  is  integer,  and  let  C\  -  Q3/(A  +  0))a  and  C2  =  c/( A  +0).  Further,  using  (13)  let  us 
define 


(15) 


'  x.k 


C, 


—  Ci  s 


2  Jx.k 


(a  +  k  -  D! 
(q  -  D! 


Hence,  h(x)  may  be  computed  as  a  sum  of  Txk: 

(16)  h(x)  -  £  TX'k. 

*- 1 


9  .*4 


9  9  .9 
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Note  that  Tx 0  —  0  since  Sx  0  =  0  and  that 

r  r  (g  +  x  —  1)! 

XJf  1  '  a:!  (a-D! 

(17)  -  (OCJxia  +  Jf  -  1)  Tx-i 

since  Sxx  -  1.  Using  (8),  we  may  now  write  Txk  in  terms  of  Tx_\  k  \  specifically,  we  obtain 

(18)  Tx.k  -  ^[C2(a  +  k  -  1  )Tx_l,k_l  +  (x  -  1)  Tx_ul. 

Thus,  h(x)  may  be  evaluated  using  only  Tx„\  k  terms.  This  provides  significant  reductions  in 
computer  memory  and  calculation  requirements  compared  to  a  direct  evaluation  of  (14)  for 
each  x 

5.  APPROXIMATIONS 

Many  inventory  models  require  computing  reorder  points  from  fractiles  of  the  leadtime 
demand  distribution.  Finding  exact  fractiles  of  the  LPG  distribution  might  be  too  demanding 
computationally  for  many  real  applications.  In  this  section  we  consider  an  approximation  which 
uses  a  scaled  version  of  the  Poisson  distribution  to  approximate  the  negative  binomial  distribu¬ 
tion. 


The  mean  and  variance  of  Z(r),  the  number  of  units  demanded  in  time  t,  are,  respec¬ 
tively, 

(19)  E(Z(t))  -  ct9/(l  -  9). 

(20)  VAR(Z(f))  *•  ct0/(l  —  0)2, 

which  gives  VAR(Z’(f))/£(Z(r))  —  (1  -  0)-1.  In  certain  circumstances,  one  may  have 
knowledge  of  the  variance  to  mean  ratio  of  the  demand  which  can  then  be  used  to  estimate  9 
directly. 

The  approximation  is  based  upon  replacing  the  negative  binomial  distribution  of  Z(r) 
with  a  scaled  Poisson  distribution.  Let  Tbe  a  Poisson  random  variable  with  parameter  /*/  and 
let  W  be  defined  by  W  —  kY  for  some  k  >  0.  We  may  think  of  IF  as  a  random  variable  which 
assumes  values  0,  k,  2k,  ...  and  whose  distribution  depends  upon  the  two  parameters  nt  and 
k. 

Since 

(21)  E{W)  —  kfit 

(22)  VAR(  IV)  —  k2fit 

we  have  VAR (W)/E{W)  —  k.  Thus,  we  set  k  —  (1  —  0)-1  to  achieve  the  same  variance  to 
mean  ratio.  Comparing  the  mean  and  variances  of  W and  Z(t)  we  see  that  ^  -  c9  (recall  that 
c  -  -x/ln(l  -  0)). 

Since  the  negative  binomial  distribution  is  defined  on  all  nonnegative  integers,  we  would 
like  the  approximation  to  be  defined  on  the  nonnegative  integers  as  well.  We  have  found  the 
following  procedure  works  well.  Assume  that  the  scaled  Poisson  probabilities  are  shifted  to 
k/ 2,  3 A/2,  5 A/2,  ...  so  that 
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P{W-  (n  +  1)  k/2) 


0,  1.  2. 


We  then  assume  that  the  cumulative  distribution  function  is  linear  between  nk/2  and 
(n  +  1 ) A-/2.  As  an  example,  suppose  9  =  .75,  /  *  1,  c  -  2  (that  is  A  —  2.77).  Then  p.  — 
1.5,  A  —  4  and 

P{W~2)  -e_M'-.2231 
P[W  -  6}  -  e'^'pt  -  .3347 
/>{«/_  10)  -  -2510 

/>(  ^  -  14}  -  3/3!  -  .1255 


The  comparison  of  the  exact  negative  binomial  probabilities  and  the  scaled  Poisson 
approximation  is  presented  in  Table  3  for  this  case. 

TABLE  3  —  Comparison  of  Negative  Binomial  and  Scaled  Poisson  Approximations 

2,  A  -  2. 


Negative  Bionomial 
Probabilities 


625 

938 


.1055 

.1055 


98 

89 

77 

66 

56 

46 

38 

31 


.0257 


Scaled  Poisson 
Cumul.  Probabilities 


Scaled  Poisson 
Probabilities 


Negative  Binomial 
Cumul.  Probabilities 


625 
.1563 
.2618 
.3673 
.4662 
.5552 
.6331 
.6998 
.7561 
.8030 
.8417 
.8729 
.8986 
.9134 
.9361 
.9495 
.9601 
.9685 
.9753 
.9806 
.9846 
.9879 
.9905 


We  now  obtain  an  approximation  to  the  LPG  distribution  by  averaging  the  scaled  Poisson 
approximation  of  the  negative  binomial  with  the  gamma  distribution  of  leadtime.  That  is, 

>  i  C°°  e-*J(p.T)xlk  f$aTa~xe~PT  , 

(24)  />(Z(r)  -  *)  =  j0  -u%),  ■  <•  rfa)  It- 


r  • 


r . 9 
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But  this  integral  is  exactly  a  Poisson  mixture  with  a  gamma  distribution  which  is  still 
another  way  that  the  negative  bionomial  distribution  can  be  derived  (see  Baswell  and  Patil  [2]). 
Hence,  the  approximation  for  the  LPG  distribution  is  a  scaled  version  of  the  negative  binomial 
distribution.  The  approximation  therefore  is: 


(25) 


P[Z(t) 


kx)  = 


(a  +  x  -  D! 
x\  (a  -  0! 


a 

d-  +fi 


M  +0 


for  x 


0,  1.  2 . 


Note  that  these  probabilities  are  defined  on  0,  k,  2k,  ....  As  with  the  scaled  Poisson  we 
suggest  shifting  these  probabilities  to  k/2,  3k/2,  ...  and  approximating  the  probability  func¬ 
tion  by  assuming  the  cumulative  distribution  function  is  linear  between  these  fractile  points. 
We  tested  a  variety  of  cases  and  found  the  fit  to  be  excellent,  especially  in  the  tails.  In  Table  4 
we  compare  the  exact  LPG  probabilities  for  the  parameter  set  considered  in  Table  2  with  the 
scaled  negative  binomial  approximation.  Note_that  since  9  =  .8,  we  have  k  —  5  and  the 
approximate  cumulative  probabilities  (labelled  H(x)  in  the  table)  are  defined  at  the  points  2.5, 
7.5,  12.5,  etc.  The  final  column  gives  the  approximate  cumulative  distribution  function  defined 
on  the  positive  integers  obtained  from  a  linear  interpolation  between  the  fractiles.  Notice  the 
close  agreement  between  the  exact  and  approximate  cumulative  probabilities  in  the  tail  of  the 
distribution. 


TABLE  4  —  The  Scaled  Negative  Binomial  Approximation 
to  the  LPG  Distribution 
(Parameters  are  the  same  a  those  of  Table  2) 


X 

Exact  Probabilities 

Approximate  Probabilities 

h(x) 

H(x) 

H(  x) 

h(x) 

Approximate 

Cumulative 

0 

.5000 

r  .5000 

.1905 

.1905 

1 

.1243 

.6243 

.1905 

.3810 

2 

.0806 

.7049 

.6667 

.1905 

.5715 

3 

.0589 

.7638 

.1174 

.6889 

4 

.0451 

.8089 

.0444 

.7333 

5 

.0355 

.8444 

.0444 

.7777 

6 

.0283 

.8727 

.0444 

.8221 

7 

.0228 

.8955 

.8889 

.0444 

.8665 

8 

.0185 

.9140 

.0296 

.8961 

9 

.0151 

.9291 

.0148 

.9109 

10 

.0124 

.9415 

.0148 

.9257 

11 

.0101 

.9516 

.0148 

.9405 

12 

.0083 

.9599 

.9630 

.0148 

.9553 

13 

.0069 

.9668 

.0099 

.9652 

14 

.0057 

.9725 

.0049 

.9701 

15 

.0047 

.9775 

.0049 

.9750 

16 

.0039 

9811 

.0049 

.9799 

17 

.0032 

.9843 

.9877 

.0049 

.9848 

18 

.0027 

.9869 

.0033 

.9881 

19 

.0022 

.9892 

.0016 

.9897 

20 

.0018 

.9910 

.0016 

.9913 
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6.  THE  FIRST  FOUR  MOMENTS  OF  THE  LPG  DISTRIBUTION 


Knowledge  of  the  moments  of  a  complex  distribution  can  be  utilized  in  a  variety  of  ways. 
The  moments  can  be  used  to  estimate  the  distribution  parameters  or  to  approximate  the  distri¬ 
bution  itself.  We  derive  the  first  four  central  moments  (moments  about  the  mean)  of  the  LPG 
distribution. 


The  distribution  of  Z(r),  the  number  of  units  demanded  in  time  t \  is  negative  binomial 
with  parameters  q  —  9.  p  —  I  -  9  and  n  -  ct.  From  Kendall  and  Stuart  [7],  the  first  four 
cumulants  of  the  negative  binomial  distribution  are  given  by 

Kx  —  nqlp,  K2  —  nq/p2,  K2  —  nq (1  +  q)/p 3  and  K4  -  nq (1  +  4 q  +  q2)/p*. 

The  first  three  cumulants  are  equal  to  the  first  three  central  moments,  respectively,  while  the 
fourth  central  moment,  fi4  is  given  by 

M4  —  K4  "V  IK2- 

Hence,  the  first  four  central  moments  (f.f.m.)  of  Z(/),  say  m,.  1  <  /  <  4,  are 


In  the  context  of  our  problem,  we  interpret  Fas  Z (t)  and  X as  r.  It  follows  that 
£(Z(r))  -  £[£|Z(t)|t]] 

-  £(ct0/(1  —  0)1 
(36)  -  caO/l 3(1  -  9). 
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(26) 

ft  i  —  ct9/(  1  —  9) 

(27) 

M2  -  ct9f(  1  -  9)2 

(28) 

M3  -  Ct9(  1  +0)/(l  -  0)3 

t 

(29) 

M4  -  ci9(  1  +  40  +  92  +  3cr0)/(l  -  0)4 

In  order  to  derive  the  f.f.m.  of  the  LPG  distribution,  we  use  the  following  relationships 
which  can  be  found  in  Parzen  [8],  p.  55:  Let  X  and  Y  be  two  (dependent)  random  variables. 

Then 

t 

(30) 

£(F)  -  E[EiY\X)] 

(31) 

VAR (Y)  -  £(VAR(F|T)]  +  VAR[£(F|AD] 

(32) 

M3(n  -  E\pli(Y\X)\  +  »3ie(y\x)] 

(33) 

li4(Y)  -  E\n4(Y\X)]  +  6£(VAR(F|J)1  •  VAR[£(  Fl*)] 

• 

+  M4tfO'U)l 

where 

(34) 

M3OO  -  £l(F  -  £(F))3] 

(35) 

h4(y)  -  £[(r-  £(r)]4] 

» 

•  it' 


t  -• 


9  . ft. 
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Similarly, 
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VAR(Z(r)  -  £[VAR(Z(r)|r]  -  VAR[£(Z(r) It)] 

-  E[cT0/d  -  0)2]  +  VAR[ct0/(1  -  0)] 

-  cad/ (fiil  -  0)2)  +  (cd)2  a/(fi2(  1  -  0)2) 

(37)  *  ca0/[fid  -  «)]2  {fi  +  cfl}. 


Following  the  same  kinds  of  arguments,  one  eventually  obtains 

(38)  M|(Z(r))  -  - -c a}B ^  (<32(1  +  «)  +  2c2fl2) 

(39)  m(Z(T))  -  l03(l  +  4#  +  ti2)  +  fi2c0(3a  +  1) 

+  6fic202a  +  c203(3a  +  6)}. 

These  results  can  be  used  to  derive  exact  expressions  for  the  skewness  and  kurtosis  of  the 
LPG  distribution.  In  particular  the  skewness,  say  SK ,  is  given  by 

SK  =Mj(Z(r))/[VAR(Z(r))P/2 

and  the  kurtosis  (KU)  is  given  by 

*£/  =  M4(Z(t))/[VAR(Z(t))]2. 


In  Table  5  we  present  examples  of  the  relationship  between  the  mean,  variance,  skewness 
and  kurtosis  for  a  variety  of  system  parameters.  We  can  see  that  each  parameter  has  a  different 
effect  on  the  four  measures  considered.  The  mean  and  the  variance  both  increase  with  A, a  and 
9  (although  at  different  rates)  and  decrease  with  fi.  The  skewness  appears  to  be  most  sensitive 
to  changes  in  a  and  fi  while  the  kurtosis  to  changes  in  alpha,  fi  and  \  and  is  relatively  insensi¬ 
tive  to  changes  in  0. 


TABLE  5  —  Means,  Variances,  Skewness,  and  Kurtosis  for 
Various  System  Parameters 


a 

J3 

0 

A 

Mean 

Var 

SK 

KU 

i 

1 

.8 

1 

2.49 

18.60 

1.78 

20.94 

i 

1 

8 

5 

12.43 

216.56 

1.38 

7771.43 

i 

1 

.8 

10 

24.85 

741.96 

1.57 

84723.23 

2 

1 

.8 

1 

4.97 

37.21 

1.26 

76.94 

5 

1 

.8 

1 

12.43 

93.02 

.79 

789.73 

10 

1 

.8 

1 

24.85 

186.04 

.56 

5393.6 

1 

2 

.8 

1 

1.24 

7.76 

2.77 

54.77 

1 

5 

.8 

1 

.50 

2.73 

5.01 

169.02 

1 

10 

.8 

1 

.25 

1.30 

7.53 

366.37 

1 

1 

.1 

1 

1.05 

2.28 

1.09 

3.34 

1 

I 

.3 

1 

1.20 

3.16 

1.18 

9.99 

1 

1 

.5 

1 

1.44 

4.97 

1.32 

20.35 

1 

1 

.9 

I 

3.91 

54.36 

2.15 

17.12 

1 

1 

.95 

1 

6.34 

167.07 

2.52 

16.85 

1 

1 

.99 

I 

21.50 

2611.9 

3.35 

22.89 

to¬ 


rn- 
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ABSTRACT 

Lei  X  denote  the  life  of  some  system.  We  define  the  observed  hazard  rate 
at  time  t,  call  it  R(t),  as  the  instantaneous  probability  (density)  of  failure  of  X 
at  time  t  given  survival  up  to  r  and  given  a  complete  description  of  the  system 
state  at  t.  We  conjecture  that  the  total  observed  hazard— namely, 
R(t)dt-  is  an  exponential  random  variable  with  mean  1  and  verify  this  for 
the  special  case  when  X  is  the  distribution  of  system  life  of  an  n  component 
system  having  an  arbitrary  monotone  structure  function. 


1.  INTRODUCTION 


Let  X  denote  the  survival  time  of  some  item  and  suppose  X  has  distribution  function  F 
and  density  /  Then,  the  hazard  rate  function  of  A'— call  it  X  (/)  —  is  defined  by 


kit) 


1  -  Fit )  ' 


As 


kit)dt  =  P{t  <  X  <  t  +  dt  |  X  >  t  }, 

we  can  interpret  A(/)  as  the  instantaneous  probability  (density)  that  an  item  of  age  t  will  fail. 


The  distribution  Fean  be  expressed  as 

1  -  Fit)  -  exp  (  -  Jo  A(s)rft), 


implying  that 

1  -  FiX)  -  exp  {-  f  A(s)ds}. 

•Z  o 


‘This  research  has  been  partially  supported  by  the  Office  of  Naval  Research  under  Contract  N00014-77-C-0299  and  the 
Air  Force  Office  of  Scientific  Research  (AFSC),  USAF,  under  Grant  AFOSR-77-32I3B  with  the  University  of  Califor¬ 
nia. 
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Now.  as  is  well-known,  1  -  /'(.V)  has  a  uniform  distribution  on  (0,1)  and  as  the  negative  loga¬ 
rithm  of  such  a  random  variable  has  an  exponential  distribution  with  mean  1,  it  follows  that 

J*  A  ( s )  rfs  —  Exponential  (1), 

or,  m  words,  the  total  hazard  experienced  by  the  item  is  exponentially  distributed  with  mean  1. 

In  the  above,  the  hazard  rate  at  time  /  was  defined  to  be  the  probability  (density)  of 
failure  at  z  given  survival  up  to  that  time.  Now,  however,  let  us  suppose  that  we  define  the 
observed  hazard  at  time  z — call  it  /?(/)  — to  be  a  random  variable  which  represents  the  actual 
probability  (density)  of  death  at  time  /  given  not  only  the  fact  of  survival  up  to  time  /  but  also  a 
complete  description  of  the  "state"  of  the  item  at  that  time.  (Such  a  quantity  would,  in  general, 
be  a  random  variable  as  it  would  be  a  function  of  the  "state"  of  the  item  at  time  /  and  the  state 
would  itself  generally  be  a  random  variable.)  For  instance,  for  a  given  individual,  R(t)  would 
denote  the  probability  (density)  of  failure  at  time  /  given  the  life  history  of  the  individual  up  to 
time  /. 

f  x 

We  conjecture  that  }  R(i)dt  is  also  an  exponentially  distributed  random  variable  and  in 

•J  0 

the  following  section,  we  verify  this  conjecture  in  the  case  of  an  n  component  coherent  system 
in  which  components  function  for  a  random  time  and  then  fail. 

2.  COHERENT  SYSTEMS  AND  THE  OBSERVED  HAZARD 

We  are  given  a  system  consisting  of  n  components  each  of  which  is  at  all  times  either 
working  or  failed.  In  addition,  we  suppose  that  whether  or  not  the  system  is  working  is  solely 
determined  as  a  function  function— call  it  0— of  the  component  states.  That  is,  letting  x,  equal 
1  or  0  according  to  whether  or  not  the  i,h  component  is  working,  then  we  suppose  that  there 
exists  a  nondecreasing  binary  function  0  such  that 

(1  if  system  works  under  state  vector  x 
0  otherwise 


Suppose  now  that  component  i  is  initially  working  and  will  work  for  a  random  time  having 
distribution  F,  at  which  time  it  will  fail,  /  =  1,  .  .  .  ,  n.  Once  a  component  has  failed,  it 
remains  failed  from  that  time  on.  Let  x,(.i)  equal  1  if  component  i  is  working  at  time  /  and  0 
otherwise  and  define  the  random  sets  C(t)  by 

C(z)  -  (/  :  0(1, .  *(/))  =  1,  0(0,.  *(/))  =  0) 

where  <t>(\ x)  =  ....  x,_|,  1.  x,+|,  ....  xn)  and  0(0,,  x)  —  0(*i,  ....  -v,_ t , 

0,.x,+| . xn).  In  words,  C(/)  is  the  set  of  critical  components  at  time  /,  where  a  component 

is  critical  at  some  time  if  its  failure  at  that  time  would  cause  the  system  to  go  from  working  to 
failed. 

If  we  let  L  denote  the  length  of  system  life,  then  assuming  independence  of  components, 
the  total  observed  hazard  experienced  by  the  system  during  its  lifetime  could  be  expressed  as 

c  ^ 

J  X  kt(t)dt  —  total  system  hazard 

IfC(t) 

where  \,(/)  is  the  (usual)  hazard  rate  function  of  component  /  at  time  /.  We  now  show  that 
this  random  quantity  is  exponentially  distributed  with  mean  1. 
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THEOREM:  The  total  system  hazard  is  an  exponential  random  variable  having  mean  1. 

PROOF:  The  proof  is  by  induction  on  n.  As  the  observed  hazard  rate  is  equal  to  the 
(usual)  component  hazard  rate  when  n  =  1.  the  result  foilows  in  this  case.  So  assume  the 
result  for  any  system  of  n  -  1  components  and  consider  an  n  component  system.  Say  a  com¬ 
ponent  is  a  1-component  minimal  cut  set  if  its  failure  guarantees  system  failure  even  if  all  other 
components  are  working.  We  consider  three  cases. 

CASE  1:  There  do  not  exist  any  1-component  minimal  cut  sets. 

In  this  case,  the  observed  hazard  rate  will  be  0  until  a  component  fails.  At  this  point,  the 
remaining  hazard  will  be  exponentially  distributed  with  mean  1  by  the  induction  hypothesis. 
Hence,  the  result  follows  in  this  case. 

CASE  2:  There  exists  exactly  one  1-component  minimal  cut  set-say  (1). 

In  this  case,  let  L\  denote  the  life  of  component  1  and  let  T  denote  the  first  time  any  of 
the  components  2  through  n  fail.  Now,  conditional  on  T  =  i,  the  total  observed  hazard  can  be 
expressed  as 

J0  ‘  Ki(s)dt  'f  J0  A i  ( s ) z/s  ^  JQ  Xi  (s)ds 

Jo  k](s)ds  +  Exp  (1)  if  Jo  '  ki(s)ds  >  JQ  k{(s)ds 

where  we  have  used  the  induction  hypothesis  in  writing  that  the  remaining  observed  hazard 
starting  at  time  T  =  rand  assuming  that  component  1  has  not  yet  failed  is  exponentially  distri¬ 
buted  with  mean  1.  Thus,  from  the  above,  we  see  that,  given  T  *  r,  the  total  observed  hazard 
has  the  same  distribution  as  the  random  variable  defined  by 

E\  if  £|  <  c 

c  +  Ei  if  E2  >  c 

where  c  is  a  constant  and  Eh  El  are  independent  exponential  random  variables  each  having 
mean  1.  Such  a  random  variable  is  easily  seen  to  be  also  exponential  with  mean  1. 

CASE  3:  There  exists  at  least  two  1-componenl  minimal  cut  sets— say  (1)  and  {2}. 

In  this  case,  we  can  combine  components  1  and  2  into  a  single  component  which  fails 
when  either  one  of  them  fails  and  the  result  follows  from  the  induction  hypothesis. 

REMARK:  The  above  proof  goes  through  in  an  identical  manner  even  when  the  com¬ 
ponent  lifetimes  are  dependent.  Of  course,  the  observed  hazard  rate  at  time  r  would  no  longer 
be  T  k,(t)  but  would  have  to  be  suitably  modified. 
itctn 

3.  SOME  FINAL  REMARKS  AND  A  HEURISTIC  ARGUMENT 

(i)  Whereas  we  have  only  established  that  the  total  observed  hazard  experienced  by  a 
system  is  exponentially  distributed  with  mean  1  for  the  rather  special  system 
described  in  Section  I,  we  believe  that  this  result  holds  with  tremendous  general¬ 
ity.  (Another  system  in  which  we  have  been  able  to  verify  it  is  when  events 


•  • 


•  • 


ft 


ft 


ft  •: 


ft  • 


ft  ft 


•  •  t 
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occur  in  accordance  with  some  arbitrary  point  process  and  each  event  has  a  ran¬ 
dom  nonnegative  damage  associated  with  it.  The  system  is  said  to  fail  the  first 
time  the  total  cumulative  damage  exceeds  some  specified  value.) 

(ii)  An  interesting  sidelight  about  the  system  of  Section  1  is  that  it  is  well-known  that 
if  all  component  life  distributions  are  IFR  (increasing  failure  rate)  in  the  sense 
that  A ,(/)  is  a  monotone  nondecreasing  function  for  ail  i  =  1,  .  .  .  ,  n,  then  it 
need  not  be  the  case  that  the  system  hazard  rate  is  also  increasing.  However,  it 
easily  follows  in  this  case  of  IFR  component  life  distributions  that  the  observed 
system  hazard  rate— namely,  Y  A, (/)  — increases  up  to  the  time  of  system 

ucti) 

failure. 


(iii) 


A  general  definition  of  the  random  hazard  can  be  given  along  the  following  lines: 
Let  (Fs,  0  <  s  <— oo)  denote  an  increasing  family  of  sigma  fields  and  let  X 
denote  a  stopping  time  defined  on  this  family.  Let 


Rit) 


P\t  <  X  <  t  +  h  I  F,) 

lim  - ; - 

a— *0  h 


where  we  assume  the  above  limit  exists  almost  surely.  We  now  claim  that,  when 
it  is  well  defined, 


J.jr 

R(t)dt  has  an  exponential  distribution  with  mean  1. 
o 

We  now  present  a  heuristic  argument  of  the  above. 


Heuristic  Argument  of  (2.1) 


Let  Hit )  —  J  R(s)ds.  We  wish  to  argue  that  HiX)  is  exponential  with  mean  1  and  to 
do  so,  we  shall  argue  that  its  failure  rate  function— call  it  A(s)— is  identically  1.  To  show  that 
A(j)  S  1,  let  us  condition  on  the  event  that  HiX)  and  on  the  values  of  Ts  and  RiTs) 
where  T,  is  defined  to  be  the  time  at  which  H  is  equal  to  s—  that  is,  HiTs )  —  s.  Now  given 
HiX)  >s,  Ts,  RiTs)-  As 

X  >  Ts  + e  with  probability  1  —  eAs  +  oie). 

Hence, 

HiX)  >  HiTs  +  «)with  probability  1  —  eAs  +  oie). 


But 


and  so 


HiTs  +  t)  -  HiTs)  +  «AS  +  oie) 
—  s  +  «Aj  +  oie) 


HiX)  >  s  +  «Aj  +  o(e)with  probability  1  -  eAs  +  o(e) 


which,  for  A,  >  0,  is  roughly  equivalent  to 

HiX)  >  s  +  6  with  probability  1  —  8  +  o(S). 
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Thus,  independent  of  Ts,  As,  given  that  H(X)  ^  s,  it  has  probability  8  +  0(8)  of  failing  during 
the  next  8  units  of  hazard.  Thus, 

P[H{X)  <  s  +  8|  H(X)  >  s}  -  8  +  0(8). 

Dividing  the  above  by  8  and  letting  8  go  to  0  "proves"  that  the  failure  rate  function  of  H(X)  is 
identically  one. 
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ABSTRACT 

In  previous  papers  [1],  [2]  the  authors  developed  a  maintenance  policy  for  a 
single  life-limited  part.  Using  an  opportunistic  replacement  approach  a  scheme 
was  devised  which  utilized  early  replacement  of  equipment  to  offset  more  costly 
future  expenditures.  This  paper  will  extend  the  results  to  the  multicomponent 
case.  Examples  are  given  illustrating  the  benefits  of  this  new  technique. 

INTRODUCTION 

Optimal  opportunistic  replacement  (OR)  and  maintenance  policies  for  single  component 
systems  have  been  extensively  discussed  {3],  [4],  (5).  In  previous  papers  [ll,  [2),  the  authors 
dealt  with  the  application  of  these  policies  to  life-limited  parts.  Specifically,  B  is  a  component 
of  a  system  with  the  following  properties: 

(1)  The  system  is  known  to  fail  exponentially  (not  due  to  B)  at  which  time  it  is 
brought  to  the  maintenance  base  at  a  cost  of  CT ;  disassembled  and  reassembled  (if 
necessary)  at  a  cost  of  CA  (cost  of  accessing);  and  repaired  according  to  its  mal¬ 
function. 

(2)  B  never  fails  but  has  a  life-limit  of  Y  hours.  After  Y  hours,  the  machine  must  be 
brought  to  the  maintenance  base  (forced  removal  —  FR)  at  a  cost  of  Cr,  disassem¬ 
bled  and  reassembled  at  a  cost  of  CA  (Cf  -  CT  +  CA)  and  part  B  replaced  at  a  cost 
of  Q. 

Compressor  units  on  jet  engines,  for  example,  contain  disks  that  satisfy  these  conditions.  The 
optimal  strategy  for  B  is  given  by  an  (A'j ,  X2 )  rule  such  that: 

(1)  If,  when  the  unit  fails,  B  is  accessible,  replace  B  if  it  has  less  than  X2  hours  of 
remaining  life. 

(2)  If,  when  the  unit  fails,  B  is  not  accessible,  replace  B  if  it  has  less  than  X\  hours  of 
remaining  life  ( A'i  ^  A"}). 


VOL  29.  NO  4,  DECEMBER  1982 


685 


naval  research  logistics  quarterly 


686  S.  EPSTEIN  AND  Y  WILAMOWSKY 

(3)  Under  all  conditions  replace  B  after  Y  hours  of  usage. 

and  X2  are  computed  by  minimizing 

Expected  (Cost)  _  E(C) 

Expected  (Life)  ~  £(£) 

where 

(1)  £(C)  -  CB  +  exp  | — ^  *l)P|(U  -  p)(l  -  e~*A)CA  +  e  *'UCF) 


(2) 

£(£)  -  Y  -  X2  +  A 

1  ,  \  (X2  -XX)P 

—  +  exp - - - 

1  -  i  - 

P  l  A 

'  1 

and 


A  —  Average  time  between  removals  (due  to  the  unit  failure) 
p  —  Probability  of  B  being  accessible  on  any  unit  breakdown. 

This  paper  will  extend  our  previous  results  to  a  multicomponent  system  (i.e.,  more  than 
one  life-limited  disk).  While  the  above-mentioned  policies  insure  optimal  utilization  of  indivi¬ 
dual  components  the  results  may  be  far  from  optimal  for  the  system  as  a  whole. 

EXAMPLE  1 :  Assume  two  identical  disks,  D\  and  D2,  with  the  following  parameters,  are 
inserted  simultaneously  into  a  jet  engine  compressor  unit  (a  unit  may  contain  as  many  as  10 
disks): 


p  -  1,  CB  -  $18,000,  A  -  5,000  hours,  CF  -  $7,800,  and  Y  —  30,000  hours. 


Applying  our  rule  on  an  individual  basis  yields  X\  -  0  and  X2  -  5,650.  The  cost  of  the  entire 
system  per  hour  is  thus,  from  (1)  and  (2) 


E(CDl)  £(C0j) 
E(LDi )  +  E(LDi) 


£(C°i)  18,000 

£U0l)  £UDj) 


20, 520  18,000 

27,735  27,735 


$1,389  per  hour. 


£(Q)2 )  "  18,000  because  (a)  p  —  1  cancels  the  CA  term;  and  (b)  no  forced  removal  cost  CF  is 
assigned  to  this  disk  since  if  an  FR  does  occur  both  disks  will  be  removed  simultaneously,  caus¬ 
ing  only  one  CF  which  has  already  been  accounted  for  in  the  calculation  of  £(C0)).  The 
optimal  policy  for  the  system  is  in  fact  Aj  —  0  and  X2  -  3,800  and  costs 


21,648  18,000 

28,863  28,863 


$1,373  per  hour. 


Considering  that  mqjor  airlines  operate  hundreds  of  engines  with  thousands  of  disks,  a  0.8  cents 
savings  per  disk  per  hour  is  appreciable. 

The  solution  of  the  multicomponent  system  has  proven  to  be  elusive.  Various  enumera- 
tive  procedures  for  the  general  class  of  OR  problems  have  been  suggested,  (6),  [7).  Application 
of  these  techniques  to  real-life  situations,  however,  requires  simplifying  assumptions  that  are 
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unrealistic  and  often  result  in  considerably  suboptimal  policies.  Additionally,  computational 
requirements  for  these  techniques  are  quite  large.  Vergin  and  Scriabin  [7]  state  that,  on  an 
IBM  370/165,  the  two  disk  problem  required  0.5  minutes  and  the  three  disk  problem  13.3 
minutes.  Any  attempted  extension  to  an  n-disk  problem  ( n  >  3)  leads  to  prohibitively  high 
computer  processing  time  and  memory  requirements.  They  conclude  by  saying  that  "it  seems 
highly  unlikely  that  any  computationally  simple  optimizing  model  will  be  developed  for  the 
maintenance  of  very  complex  equipment." 

The  structure  of  the  life-limited  problem  and  the  technique  of  our  solution  in  the  single 
disk  case  offer  an  approach  not  previously  investigated.  It  involves  an  analytical  procedure  that 
facilitates  handling  of  even  the  //-disk  paradigm.  We  present  the  solution  to  the  two  disk  prob¬ 
lem.  The  generalization  to  //-disks  is  similar.  Illustrative  examples  are  provided. 

PROBLEM  FORMULATION 

Consider  two  disks,  1  and  2,  where  2  has  A  7"  more  hours  of  life  remaining  than  1  (Figure 
1).  We  will,  for  the  sake  of  a  more  lucid  development,  assume  throughout  this  paper  that  p  — 
1,  i.e.,  every  engine  maintenance  insures  accessibility  of  the  entire  unit.  The  generalization  for 
p  <  1  can  be  readily  derived.  The  following  notation  will  be  used: 

X,  —  The  removal  point  for  disk  /  (only  one  accessible  cut-off  point  per  disk  is 
necessary). 

CB  —  Cost  of  disk  /'. 

Y,  —  Life  limit  of  disk  /. 

E(C,)  —  Expected  cost  of  disk  i  in  a  one  disk  system. 

E(L,)  —  Expected  life  of  disk  /  in  a  one  disk  system. 

E(C 2/|)  —  Expected  cost  of  disk  2  in  a  system  which  also  contains  disk  1. 

E(L 2/\)  —  Expected  life  of  disk  2  in  a  system  which  also  contains  disk  1. 

P |  —  The  probability  that  disk  2  is  retained  when  disk  1  is  removed. 

£(C,)  and  £(£,)  can  be  found  by  substituting  the  desired  X,  into  the  simplified  forms  of  equa¬ 
tions  (1)  and  (2),  i.e., 

(3)  E(C,)  -  CB)  +  e~XjKCF 

(4)  E(L,)  -  Y,  —  X,  +  A(1  —  e~X,/K). 


Any  time  disk  1  arrives  with  less  than  X{  hours  of  remaining  life  it  will  be  removed  and 
disk  2  will  be  checked  with  respect  to  X2-  If  1,  however,  exceeds  T|,  disk  2  under  all  cir¬ 
cumstances  will  remain.  The  reasoning  is  simple,  An  FR  attributable  to  2  is  impossible  if  !  is 
still  operational.  In  the  worst  case,  1  will  lead  to  an  FR  and  make  2  accessible  at  that  time. 
Since  removal  of  a  disk  is  justifiable  only  as  a  deterrent  to  a  possible  FR,  a  decision  to  retain  1 
must  automatically  retain  2  as  well.  A  flowchart  of  our  policy  is  given  in  Figure  2. 
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Pi «k  ?}<  at  ^ 

jDisk  1 

•Remaining  Life 


Figure  1.  Two  disk  paradigm 


ANALYSIS 

The  relevant  factors  are  summarized  in  the  following  series  of  statements: 

STATEMENT  I:  The  cost  per  hour  of  the  system  is  equal  to  the  sum  of  the  costs  per 
hour  of  the  individual  disks. 
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STATEMENT  2:  The  objective  in  the  2-disk  problem  is  to  minimize 
m  g(C,>  ,  EiCy ,) 

£(£,)  £(£2/!) 

PROOF:  The  cost  and  life  of  disk  1  is  a  function  of  X{  alone.  The  cost  and  life  of  disk  2, 
however,  as  we  have  previously  explained  is  dependent  on  both  X\  and  X2. 

STATEMENT  3:  The  optima!  X2  may  be  derived  by  the  single  disk  approach,  i.e.,  find 
the  X2  that  minimizes  £(C2)/ £(£2)  of  (3)  and  (4). 

PROOF:  Since  X2  is  used  only  if  disk  1  has  been  removed,  the  statement  is  true  by 
dynamic  programming. 

STATEMENT  4: 

(6)  E(CVI)  =  P\E(C2)  -Ml  -  Pl)CB2 

PROOF:  If  disk  2  remains  after  disk  1  has  been  removed,  then  it  may  generate  an  FR. 
Its  expected  cost  is  therefore  given  by  E(C2)  of  (3).  Disk  2  can  never  cause  a  forced  removal 
if  it  is  removed  together  with  disk  1.  Its  expected  cost  in  this  situation  is  simply  its  original 
cost. 


STATEMENT  5:  The  expected  life  of  disk  2  given  that  it  is  removed  together  with  disk  1 
is: 

(7)  Y2  -  Min  (X2.  Xi  +  A  T)  +  A(1  -  e-Min(*J-*r-*i)/x). 

PROOF:  (a)  If  X2  ^  Xx  +  AT  then  whenever  disk  1  is  below  its  cut-off  value  disk  2  is 
also  below  its  cut-off  point.  Thus,  the  two  disks  will  always  be  removed  together.  The  double 
removal  will  occur  due  to  an  accessible  removal  when  disk  1  has  less  than  X\  hours  remaining 
(and  disk  2  has  less  than  X\  +  A 7),  or  an  FR  when  all  of  disk  l’s  permissible  life  has  been 
expended  (and  disk  2  has  ATof  remaining  life).  The  expected  value  is 

f  ( Y2  -  (A-!  +  4D+Jt)|  e~*/Kdx  +  ( Y2  -  A  T)  f  —  e~x/Kdx. 

(b)  If  X2  <  X\  +  AT  it  is  possible  that  disk  1  is  removed  and  disk  2  is  retained.  The 
knowledge  that  both  are  removed  simultaneously  insures  that  both  were  in  use  when  disk  2  had 
X2  hours  of  remaining  life,  or  equivalently  when  disk  1  had  X2  —  A  T  hours  of  life  remaining. 
Since  X2  —  AT  <  X\  the  next  breakdown,  if  it  occurs  prior  to  an  FR,  will  result  in  the  removal 
of  both  disks.  The  expected  life  is  therefore, 

J/J  Ar  (Y2  -  X2  +  x)  7-  e-*lKdx  +  (Y2-  AT)  f "  _  -  e~*tKdx. 

Simplifying  both  expressions  yields  (7). 

STATEMENT  6: 

(8)  £(I2/|)  -  P\E(L2)  +  (1  -  P\)(Y2  -  Min(T2,  X]  +  AT)  +  A (1  -  e_Mm<*2~*r*I’/x 


VOL.  29.  NO.  4,  DECEMBER  1982 


NAVAL  RESEARCH  LOGISTICS  QUARTERLY 


690 


S  EPSTEIN  AND  Y.  WILAMOWSKY 


PROOF:  Direct  application  of  expected  value  and  (7). 
STATEMENT  7: 


(9) 


P\ 


u  if  jr,  +  A  T  <  X2 

1  if  X2  <  A  T 

I  _  e~(Xi+bT~x2)/K  otherwise 


PROOF:  In  the  first  case  the  disks  will  always  be  removed  together  while  in  the  second 
case  the  time  differential  is  so  great  that  the  disks  have  no  effect  on  each  other.  In  all  other 
cases  a  breakdown  with  A.T,  remaining  time  on  disk  1  (AXj  —  Xj  -  x,  where  x  is  time  to  first 
breakdown  after  disk  has  reached  its  critical  value)  means  that  disk  2  has  A  A'j  +  AT  time  left. 

T,  -  P( AT,  +  AT  >  X2) 


m,  -  x  +  AT  >  X2) 
P(x  <  Jr,  +  A T  -  X2) 


X 


y.+at-y. 


-  g-x/K 

A 


which  simplifies  to  (9). 

It  is  now  clear  that  the  objective  function  (5)  can  be  broken  down  into  a  function  of  X\ 
and  X2  using  equations  (6),  (8),  and  (9).  Since  X2  has  already  been  computed  in  the  standard 
manner,  the  only  remaining  variable  is  X\.  The  optimal  X\  can  thus  be  computed  by  using  a 
one  dimensional  search  on  the  objective  function. 

EXAMPLE  2:  Referring  to  Example  1  (AT=0),  an  X2  value  of  5,650  minimizes 
E(C2)/E(L2).  Inserting  this  value  into  (8)  and  (9)  the  X|  value  that  minimizes  the  objective 
function  (5)  is  3,800.  Note  that  Pt  for  this  combination  is  0(X|  +AT  <  X2),  guaranteeing 
that  both  disks  will  always  be  removed  simultaneously  and  that  X2  need  never  be  actually 
applied.  It  is  intuitively  obvious  and  can  be  easily  shown  that  one  disk  having  the  combined 
cost  of  disks  1  and  2  will  have  the  same  3,800  cut-off  value. 


EXAMPLE  3:  We  will  again  use  the  2  disks  of  Example  1  except  that  this  time  disk  1  is 
3,000  hours  older  than  disk  2,  i.e.,  AT  =  3,000.  As  before  X2  is  initially  computed  as  5,650 

and  an  Xi  of  4,250  will  minimize  (5)  at  a  cost  of  =  1.444  dollars  per  hour. 

28,613  26,754 

Because  the  2  disks  have  different  remaining  lives  this  system  costs  an  additional  7  cents  per 
hour  to  run  (1.444-1.373).  If  the  engine  is  in  for  repairs  when 

(a)  disk  1  has  more  than  4,250  hours  left  — leave  both  disks 

(b)  disk  1  has  less  than  4,250  but  more  than^,650  hours  remaining— remove  disk  1 
and  leave  disk  2 

(c)  disk  1  has  less  than  2,650  hours  left— replace  both  disks. 
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EXAMPLE  4:  Suppose  the  two  disks  differ  in  age  by  over  5,650  hours,  i.e.,  A  T  >  5,650 
■»  X2.  By  (9)  P\  =  1  and  everything  reduces  to  a  computation  of  the  individual  optimal  cut¬ 
off  points  for  each  disk,  i.e.,  Xx  =  *2  =  5,650.  The  system  costs  0.74  +  0.74  =  SI. 48  per 
hour.  We  can  conclude  that  as  the  disk  separation  time  increases  from  0  to  X2  the  minimal 
cost  per  hour  will  continuously  increase. 

CONCLUSIONS 

A  variation  of  this  procedure  was  used  in  a  simulation  run  of  actual  engine  configurations 
of  a  major  passenger  airline.  Computational  requirements  caused  no  problem  for  even  a 
seven-disk  system.  Additionally,  the  results  were  compared  to  various  alternate  approaches 
(e.g.  Markov  analysis)  and  found  to  be  superior  to  them  all. 

It  should  be  noted  that  certain  underlying  assumptions  are  inherent  in  our  analysis.  In  a 
2-disk  system  for  example,  computation  of  X2  is  necessary  for  the  computation  of  Xx  and  it  is 
assumed  if  disk  1  is  removed,  X2  will  be  our  criteria  for  judging  the  wisdom  of  removing  disk 
2.  This,  however,  depends  on  the  amount  of  life  in  disk  l’s  replacement.  If  the  new  disk  has 
less  life  remaining  than  disk  2,  then  by  previous  arguments  disk  2  will  be  retained  regardless  of 
X2.  If  the  new  disk,  on  the  other  hand,  has  more  remaining  life  than  disk  2,  then  disk  2 
becomes  disk  1  in  the  new  configuration.  As  such,  its  cut-off  point  is  not  the  previously  com¬ 
puted  X2  but  a  new  Xx.  In  order  for  our  procedure  to  yield  optimal  results  the  new  Xx  would 
have  to  equal  the  old  X2.  As  discussed  in  the  2-disk  system  if  A  T  >  X2,  disk  1  is  unaffected  by 
disk  2  and  can  be  treated  as  an  individual  disk.  Life  specifications  of  jet  engines  are  such  that  if 
disk  2  in  the  initial  system  was  close  enough  in  life  to  disk  1  to  influence  it,  then  the  new  disk 
which  replaces  disk  1  will  have  considerably  more  life  than  the  remaining  disk,  and  hence  not 
affect  it.  Under  these  conditions  the  use  of  X2  in  our  policy  is  justified. 

However,  what  if  we  are  dealing  with  a  situation  where  this  condition  is  not  met?  Engine 
build  policies  may  very  well  dictate  the  replacement  of  a  worn  out  disk  with  remaining  life  that 
is  comparable  to  the  retained  disk.  As  discussed  in  Example  4  such  a  policy  offers 
configurations  leading  to  lower  costs  per  hour.  However,  the  necessary  prerequisite  for 
guaranteed  optimality  is  missing.  Additionally,  when  removing  one  or  two  disks  the  time 
differential  between  the  disk  will  generally  change.  Hence,  although  we  minimize  the  present 
configuration  we  in  no  way  consider  the  effect  our  decision  will  have  on  future  utilization. 

Because  of  these  arguments  the  multidisk  extension  is  presented  not  as  the  optimal  stra¬ 
tegy  but  rather  as  a  heuristic.  The  complexity  of  the  problem  necessitates  the  use  of  a  more 
limited  objective.  Disregarding  future  contingencies,  we  restrict  ourselves  to  the  optimization 
of  currently  operating  disks  only.  The  application  of  a  logical  policy  in  the  present  will  hope¬ 
fully  lead  to  near  optimal  system  utilization  in  the  future  as  well.  The  simulation  runs  previ¬ 
ously  mentioned  indicated  that  our  policy  will  converge  on  uniform  configurations  in  a  rela¬ 
tively  short  period  of  time.  Pending  further  investigation,  the  authors  are  convinced  that,  at 
least  in  the  jet-engine  problem,  the  approach  is  sound  and  far  superior  to  any  other  presently 
employed. 
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ABSTRACT 

Let  r  be  a  finite  stopping  time  with  random  hazard  rate  function 
(\,:r  >  0).  We  prove  that  /<J  A,  dt  is  exponentially  distributed  with  mean  1. 


1.  INTRODUCTION 


In  their  paper  published  in  this  issue  (Naval  Research  Logistics  Quarterly,  29,  4,  1982), 
Brown-Ross  [1]  introduce  the  notion  of  a  "randomly  evolving  hazard  rate  function."  Perhaps 
the  simplest  case  of  such  a  function  is  known  in  reliability  theory  as  the  failure  rate  function. 
Let  X  denote  the  survival  time  of  some  item  and  suppose  X  has  distribution  function  F  and 
density  /  Then,  the  failure  rate  function  of  A'— call  it  A  (•)— is  defined  by: 


(1.1) 


kit) 


1  -  Fit) 


and  since  kit)  dt  =  PiX  Zit.t  +  dt]  \X  >  t)  one  can  interpret  kit)  as  the  instantaneous  pro¬ 
bability  density  that  an  item  of  age  t  will  fail.  It  is  well  known  that  the  total  hazard  experienced 
by  the  item,  i.e., 

(1.2)  f X kit)  dt 

J  0 

is  exponentially  distributed  with  unit  rate.  Now  suppose  that  we  are  able  to  observe  more  than 
just  whether  or  not  the  item  is  functioning  at  t  but  also  a  complete  description  of  the  "state"  of 
the  item  at  that  time.  Then  the  interpretation  of  k  (•)  becomes: 


(1.3)  k  (/)  dt-PiX€  it,t  +  dt]  I  complete  information  of  state  of  item  at  t). 

Such  a  quantity  is,  in  general,  a  random  variable  and  its  value  is  determined  by  the  state  of  the 
compon|nt  at  /  which  is  itself  a  random  variable.  Brown-Ross  [11  conjectured  that  even  in  this 
case  J  k  it)  dt  is  exponentially  distributed  with  unit  rate.  They  verified  the  conjecture  in 
numerous  cases  taken  from  well-known  models  in  statistics,  reliability  and  Markov  chains. 
However,  for  the  general  case  they  only  provided  a  heuristic  proof. 
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Using  recent  developments  in  the  martingale  theory  of  jump  processes,  we  shall  give  a 
precise  formulation  of  the  conjecture  and  prove  it. 

2.  THE  CONJECTURE  AND  ITS  PROOF 

Let  (fl,  F,  P)  be  a  probability  space  and  let  [H,:t  >  0}  be  a  complete,  right  continuous, 
increasing  family  of  sub -a-  fields.  A  stochastic  process  [X,:t  >  0}  is  said  to  be  adapted  to  [H,] 
if  for  each  t  >  0  X,  is  //, -measurable.  It  is  said  to  be  {//,}  predictable  if  there  exists  a  sequence 
{ AT/ " ’  r  >  0)  of  stochastic  processes  such  that  for  each  t  ^  0  lim  X,{n)  -  X,  and  for  each 

» W"’}  is  left  continuous  and  adapted  to  [H,]. 

Let  t  be  any  finite  stopping  time  defined  on  {//,},  i.e..  Pi 0  <  r  <°°)  =  1  and  for  each 
/  >  0,  {t  <  /}  belongs  to  Hr  Define  the  following  process: 

1  r  <  t 

^ ^  ”  0  otherwise 

clearly  /,  is  a  submartingale.  Hence,  by  the  Doob-Meyer  decomposition  theorem  there  exists  a 
unique  increasing  process,  call  it  [A, :t  ^  0},  such  that: 

(2.2)  (i)  ^0-O 

(ii)  A  is  right  continuous 

(iii)  A  is  {//,} -predictable 

and: 

(2.3)  M,  —  /,  —  A,  is  {//,}  martingale.  Now  suppose  that  [A,]  has  a  differentiable  sample 

path,  and  let  —  -7-  A,.  Then  the  stochastic  process  {\,:r  >  0}  is  the  random  hazard  rate  of 
at 

the  stopping  time  t.  To  see  this,  note  that  since  { M ,}  is  a  martingale,  we  have: 

ElM,+d,  -  M'\H,)  -  0 

hence 

EUt+di  ~  4l^/l  -  ElA,+a,  ~  A,\H,\ 
but 

Elll+dl  -  I,\H,]  -  P(t  €  (t,t  +  dt]\H,) 
and 

E[A,+dl  -  A,\H,)  =  X,  dt. 

Now  we  state  the  conjecture  as  a  theorem: 

THEOREM:  If  \A,}  is  differentiable  with  derivative  (X,J  then  AT  =  J*Q  X,  dt  has  an 
exponential  distribution  with  mean  I. 

The  proof  is  based  on  the  following  lemma: 
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LEMMA:  ElA?]  -  n\  n  -  0. 1.2.  ••• 

PROOF:  It  is  well  known  (consult  for  instancy  [3.  Ch.  18])  that  for  any  finite  {//,)- 
predictable  process  {  the  stochastic  integral  J*Q  fu  d  Mu  is  a  {//^-martingale.  In  partic¬ 

ular,  since  {A,}  is  predictable,  for  each  n  (n  =  0,1,2..) 

(2.4)  =  Jo'  a;  dMu  s  $’  a;  dlu  -  Jo'  < 
is  a  martingale. 

Thus:  £(flT(n)  1  ~  0 

(2.5)  £j/ofA^/u]  =  £j/or  A;xur/W] 

or  equivalently: 

f  Jn+l 

(2.6)  U”r)-Ah— 

[  n  + 1 

which  implies  that 

(2.7)  UT”]  =  »!  n  =  0,  1, 

This  completes  the  proof  of  the  Lemma. 

PROOF  OF  THE  THEOREM.  First  note  that  the  nth  moment  of  an  exponential  distribu¬ 
tion  with  mean  1  is  n\ 

It  is  well  known  [2.  p.228]  that  a  distribution  function  is  uniquely  determined  by  its 
moments  whenever  the  power  series: 

Z 

n  (2 n) ! 

converges  in  some  interval.  This  clearly  holds  for  the  expohential  distribution  and  thus  Ar  has 
an  exponential  distribution  with  mean  1.  This  completes  the  proof  of  the  theorem. 

REMARK:  Note  that  a  sufficient  condition  for  Ar  to  have  an  exponential  distribution 

with  mean  I  is  that  the  sample  path  of  A  is  continuous,  (2.4)  becomes  S'  A"  dlu  -  s:  A”  dAu 
which  for  each  sample  path  of  A  is  merely  a  difference  between  two  Stieltjes  integrals  and  thus 
(2.6)  and  (2.7)  remain  valid 
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